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TO ALL WHOM IT MAY CONCERN: 

Be it known that We, Lloyd G. Mitchell, Mariano A. Garcia-Blanco, citizens of 
the United States and Madaiah Puttaraju and S. Gary Mansfield, citizens of India and Great 
Britain respectively, residing in the United States, City of Durham, State of North Carolina, 
whose post office addresses are 4500 Highgate Drive, Durham, North Carolina 27713, 12 
Sanderling Court, Durham, North Carolina 27713, 416 Tall Oaks Drive, Durham, North Carolina 
27713 and 1005 Prologue Road, Durham, North Carolina 27712, respectively, have invented an 
improvement in 

METHODS AND COMPOSITIONS FOR USE IN 
SPLICEOSOME MEDIATED RNA TRANS-SPUCTNG IN PLANTS 

of which the following is a 



SPECIFICATION 



The present application is a continuation-in-part of pending application serial 
number 09/158,863 filed September 23, 1998 which is a continuation-in-part of serial number 
09/133,717 filed on August 13, 1998 which is a continuation-in-part of serial number 09/087,233 
filed on May 28, 1998, which is a continuation-in-part of pending application serial number 
08/766,354 filed on December 13, 1996, which claims benefit to provisional application number 
60/008,317 filed on December 15, 1995. 
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The present invention was made with government support under Grant Nos. SBIR 
R43DK56526-01 and SBIR R44DK56526-02. The government has certain rights in the 
invention. 

1. INTRODUCTION 

[0001] The present invention provides methods and compositions for generating novel 

nucleic acid molecules through targeted spliceosomal trans-splicing. The compositions of the 
invention include pre-/raws-splicing molecules (PTMs) designed to interact with a natural target 
precursor messenger RNA molecule (target pre-mRNA) and mediate a trans-splicing reaction 
resulting in the generation of a novel chimeric RNA molecule (chimeric RNA). The PTMs of 
the invention are genetically engineered so as to result in the production of a novel chimeric 
RNA which may itself perform a function, such as inhibiting the translation of the RNA, or that 
encodes a protein that complements a defective or inactive protein in a cell, or encodes a toxin 
which kills specific cells. Generally, the target pre-mRNA is chosen as a target because it is 
expressed within a specific cell type thus providing a means for targeting expression of the novel 
chimeric RNA to a selected cell type. The invention further relates to PTMs that have been 
genetically engineered for the identification of exon/intron boundaries of pre-mRNA molecules 
using an exon tagging method. In addition, PTMs can be designed to result in the production of 
chimeric RNA encoding for peptide affinity purification tags which can be used to purify and 
identify proteins expressed in a specific cell type. The methods of the invention encompass 
contacting the PTMs of the invention with a target pre-mRNA under conditions in which a 
portion of the PTM is trans-spliced to a portion of the target pre-mRNA to form a novel chimeric 
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RNA molecule. The methods and compositions of the invention can be used in cellular gene 
regulation, gene repair and suicide gene therapy for treatment of proliferative disorders such as 
cancer or treatment of genetic, autoimmune or infectious diseases. In addition, the methods and 
compositions of the invention can be used to generate novel nucleic acid molecules in plants 
through targeted splicesomal /raws-splicing. For example, targeted trans-splicing may be used 
to regulate gene expression in plants for treatment of plants diseases, engineering of disease 
resistant plants or expression of desirable genes in plants. The methods and compositions of the 
invention can also be used to map intron-exon boundaries and to identify novel proteins 
expressed in any given cell. 

2. BACKGROUND OF THE INVENTION 
[0002] DNA sequences in the chromosome are transcribed into pre-mRNAs which 

contain coding regions (exons) and generally also contain intervening non-coding regions 
(introns). Introns are removed from pre-mRNAs in a precise process called splicing (Chow et 
al, 1977, Cell 12:1-8; and Berget, S.M. et aL, 1977, Proc. Natl. Acad. Sci. USA 74:3171-3175). 
Splicing takes place as a coordinated interaction of several small nuclear ribonucleoprotein 
particles (snRNP ? s) and many protein factors that assemble to form an enzymatic complex 
known as the spliceosome (Moore et aL, 1993, in The RNA World, R.F. Gestland and J.F. 
Atkins eds. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.); Kramer, 1996, 
Annu. Rev. Biochem., 65:367-404; Staley and Guthrie, 1998, Cell 92:315-326). 
[0003] Pre-mRNA splicing proceeds by a two-step mechanism. In the first step, the 5 f 

splice site is cleaved, resulting in a "free" 5' exon and a lariat intermediate (Moore, MJ. and 
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P.A. Sharp, 1993, Nature 365:364-368). In the second step, the 5 ? exon is ligated to the 3' exon 
with release of the intron as the lariat product. These steps are catalyzed in a complex of small 
nuclear ribonucleoproteins and proteins called the spliceosome. 

[0004] The splicing reaction sites are defined by consensus sequences around the 5 ? and 

3 ! splice sites. The 5 f splice site consensus sequence is AG/GURAGU (where A=adenosine, U = 
uracil, G = guanine, C = cytosine, R = purine and / = the splice site). The 3' splice region 
consists of three separate sequence elements: the branch point or branch site, a polypyrimidine 
tract and the 3' splice consensus sequence (YAG). These elements loosely define a 3 1 splice 
region, which may encompass 100 nucleotides of the intron upstream of the 3' splice site. The 
branch point consensus sequence in mammals is YNYURAC (where N = any nucleotide, Y= 
pyrimidine). The underlined A is the site of branch formation (the BPA = branch point 
adenosine). The 3' splice consensus sequence is YAG/G. Between the branch point and the 
splice site there is usually found a polypyrimidine tract, which is important in mammalian 
systems for efficient branch point utilization and 3' splice site recognition (Roscigno, R., F. et aL, 
1993, J. Biol. Chem. 268:11222-11229). The first YAG trinucleotide downstream from the 
branch point and polypyrimidine tract is the most commonly used 3' splice site (Smith, C.W. et 
aL, 1989, Nature 342:243-247). 

[0005] In most cases, the splicing reaction occurs within the same pre-mRNA molecule, 

which is termed c/s-splicing. Splicing between two independently transcribed pre-mRNAs is 
termed trans-splicing. Traws-splicing was first discovered in trypanosomes (Sutton & 
Boothroyd, 1986, Cell 47:527; Murphy et al, 1986, Cell 47:517) and subsequently in nematodes 
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(Krause & Hirsh, 1987, Cell 49:753); flatworms (Rajkovic et al, 1990, Proc. Nat'l. Acad. Sci. 
USA, 87:8879; Davis et al, 1995, J. Biol. Chem. 270:21813) and in plant mitochondria (Malek 
et al, 1997, Proc. Natl Acad. Sci. USA 94:553). In the parasite Trypanosoma brucei, all 
mRNAs acquire a splice leader (SL) RNA at their 5 1 termini by *raws-splicing. A 5* leader 
sequence is also /raws-spliced onto some genes in Caenorhabditis elegans. This mechanism is 
appropriate for adding a single common sequence to many different transcripts. 
[0006] The mechanism of fraws-splicing, which is nearly identical to that of conventional 

czs-splicing, proceeds via two phosphoryl transfer reactions. The first causes the formation of a 
2 f -5 f phosphodiester bond producing a 'Y' shaped branched intermediate, equivalent to the lariat 
intermediate in c/s-splicing. The second reaction, exon ligation, proceeds as in conventional cis- 
splicing. In addition, sequences at the 3 ! splice site and some of the snRNPs which catalyze the 
fraws-splicing reaction, closely resemble their counterparts involved in c/s-splicing. 
[0007] Traws-splicing may also refer to a different process, where an intron of one pre- 

mRNA interacts with an intron of a second pre-mRNA, enhancing the recombination of splice 
sites between two conventional pre-mRNAs. This type of ^raws-splicing was postulated to 
account for transcripts encoding a human immunoglobulin variable region sequence linked to the 
endogenous constant region in a transgenic mouse (Shimizu et al, 1989, Proc. Nat'l. Acad. Sci. 
USA 86:8020). In addition, /raws-splicing of c-myb pre-RNA has been demonstrated (Vellard, 
M. et al. Proc. Natl Acad. Sci., 1992 89:2511-2515) and more recently, RNA transcripts from 
cloned SV40 ^raws-spliced to each other were detected in cultured cells and nuclear extracts (Eul 
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et al 9 1995, EMBO. J. 14:3226). However, naturally occurring trans-splicing of mammalian 
pre-mRNAs is thought to be an exceedingly rare event. 

[0008] In vitro trans-splicing has been used as a model system to examine the 

mechanism of splicing by several groups (Konarska & Sharp, 1985, Cell 46:165-171 Solnick, 
1985, Cell 42:157; Chiara & Reed, 1995, Nature 375:510; Pasman and Garcia-Blanco, 1996, 
Nucleic Acids Res. 24:1638). Reasonably efficient trans-splicing (30% of c/s-spliced analog) 
was achieved between RNAs capable of base pairing to each other, splicing of RNAs not 
tethered by base pairing was further diminished by a factor of 10. Other in vitro ^raws-splicing 
reactions not requiring obvious RNA-RNA interactions among the substrates were observed by 
Chiara & Reed (1995, Nature 375:510), Bruzik J.P. & Maniatis, T. (1992, Nature 360:692) and 
Bruzik J.P. and Maniatis, T., (1995, Proc. Nat'l. Acad. Sci. USA 92:7056-7059). These reactions 
occur at relatively low frequencies and require specialized elements, such as a downstream 5 1 
splice site or exonic splicing enhancers. 

[0009] In addition to splicing mechanisms involving the binding of multiple proteins to 

the precursor mRNA which then act to correctly cut and join RNA, a third mechanism involves 
cutting and joining of the RNA by the intron itself, by what are termed catalytic RNA molecules 
or ribozymes. The cleavage activity of ribozymes has been targeted to specific RNAs by 
engineering a discrete "hybridization" region into the ribozyme. Upon hybridization to the target 
RNA, the catalytic region of the ribozyme cleaves the target. It has been suggested that such 
ribozyme activity would be useful for the inactivation or cleavage of target RNA in vivo, such as 
for the treatment of human diseases characterized by production of foreign of aberrant RNA. 
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The use of antisense RNA has also been proposed as an alternative mechanism for targeting and 
destruction of specific RNAs. In such instances small RNA molecules are designed to hybridize 
to the target RNA and by binding to the target RNA prevent translation of the target RNA or 
cause destruction of the RNA through activation of nucleases. 

[0010] Until recently, the practical application of targeted trans-splicing to modify 

specific target genes has been limited to group I ribozyme-based mechanisms. Using the 
Tetrahymena group I ribozyme, targeted ^raws-splicing was demonstrated in E. coli. coli 
(Sullenger B.A. and Cech. T.R., 1994, Nature 341:619-622) , in mouse fibroblasts (Jones, J.T. et 
al., 1996, Nature Medicine 2:643-648), human fibroblasts (Phylacton, L.A. et al. Nature Genetics 
18:378-381) and human erythroid precursors (Lan et al, 1998, Science 280:1593-1596). While 
many applications of targeted RNA /raws-splicing driven by modified group I ribozymes have 
been explored, targeted splicing mediated by native mammalian splicing machinery, i.e., 
spliceosomes, has not been previously reported. 

3. SUMMARY OF THE INVENTION 
[0011] The present invention relates to compositions and methods for generating novel 

nucleic acid molecules through spliceosome-mediated targeted frans-splicing. The compositions 
of the invention include pre-rra«s-splicing molecules (hereinafter referred to as "PTMs") 
designed to interact with a natural target pre-mRNA molecule (hereinafter referred to as "pre- 
mRNA") and mediate a spliceosomal /raws-splicing reaction resulting in the generation of a 
novel chimeric RNA molecule (hereinafter referred to as "chimeric RNA"). The methods of the 
invention encompass contacting the PTMs of the invention with a natural target pre-mRNA 
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under conditions in which a portion of the PTM is spliced to the natural pre-mRNA to form a 
novel chimeric RNA. The PTMs of the invention are genetically engineered so that the novel 
chimeric RNA resulting from the trans-splicing reaction may itself perform a function such as 
inhibiting the translation of RNA, or alternatively, the chimeric RNA may encode a protein that 
complements a defective or inactive protein in the cell, or encodes a toxin which kills the 
specific cells. Generally, the target pre-mRNA is chosen because it is expressed within a 
specific cell type thereby providing a means for targeting expression of the novel chimeric RNA 
to a selected cell type. The target cells may include, but are not limited to those infected with 
viral or other infectious agents, benign or malignant neoplasms, or components of the immune 
system which are involved in autoimmune disease or tissue rejection. The PTMs of the 
invention may also be used to correct genetic mutations found to be associated with genetic 
diseases. In particular, double-/ra«5-splicing reactions can be used to replace internal exons. 
The PTMs of the invention can also be genetically engineered to tag exon sequences in a mRNA 
molecule as a method for identifying intron/exon boundaries in target pre-mRNA. The invention 
further relates to the use of PTM molecules that are genetically engineered to encode a peptide 
affinity purification tag for use in the purification and identification of proteins expressed in a 
specific cell type. The methods and compositions of the invention can be used in gene 
regulation, gene repair and targeted cell death. Such methods and compositions can be used for 
the treatment of various diseases including, but not limited to, genetic, infectious or autoimmune 
diseases and proliferative disorders such as cancer and to regulate gene expression in plants. 
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4. BRIEF DESCRIPTION OF THE DRAWINGS 
[0012] Figure 1 A. Model of Pre-7ra«s-splicing RNA. 

[0013] Figure IB. Model PTM constructs and targeted *ra«s-splicing strategy. 

Schematic representation of the first generation PTMs (PTM+Sp and PTM-Sp). BD, binding 
domain; NBD, non-binding domain; BP, branch point; PPT, pyrimidine tract; ss, splice site and 
DT-A, diphtheria toxin subunit A. Unique restriction sites within the PTMS are indicated by 
single letters: E; EcoRI; X, Xhol; K, Kpnl; P, Pstl; A, Accl; B, BamHI and H; Hindlll. 
[0014] Figure 1C. Schematic drawing showing the binding of PTM+Sp via conventional 

Watson Crick base pairing to the (3HCG6 target pre-mRNA and the proposed cis- and trans- 
splicing mechanism. 

[0015] Figure 2A. In vitro frans-splicing efficiency of various PTM constructs into 

PHCG6 target. A targeted binding domain and active splice sites correlate with PTM trans- 
splicing activity. Full length targeted (pcPTM+Sp), non-targeted (PTM-Sp) and the splice 
mutants [Py(-)AG(-) and BP(-)Py(-)AG(-)] PTM RNAs were added to splicing reactions 
containing pHCG6 target pre-mRNA. The products were RT-PCR amplified using primers 
pHCG-F (specific for target pHCG6 exon 1) and DT-5R (complementary to DT-A) and analyzed 
by electrophoresis in a 1.5% agarose gel. 

[0016] Figure 2B. In vitro /ra/w-splicing efficiency of various PTM constructs. Full 

length PTM with a spacer between the binding domain and splice site (PTM+Sp), PTM without 
the spacer region (PTM+) and short PTMs that contain a target binding domain (short PTM+) or 
a non-target binding region (PTM-) were added to splicing reactions containing pHCG target 
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pre-mRNA. The products were RT-PCR amplified using primers pHCG-F and DT-3. For 
reactions containing the short PTMs, the reverse PCR primer was DT-4, since the binding site 
for DT-3 was removed from the PTM. 

[0017] Figure 3. Nucleotide sequence demonstrating the in vitro trans-spliced product 

between a PTM and target pre-mRNA (SEQ ID NO: 53), The 466 bp trans-spliced RT-PCR 
product from Figure 2 (lane 2) was re-amplified using a 5* biotin labeled forward primer (PHCG- 
F) and a nested unlabeled reverse primer (DT-3R). Single stranded DNA was purified and 
sequenced directly using toxin specific DT-3R primer. The arrow indicates the splice junction 
between the last nucleotide of target (3HCG6 exon I and the first nucleotide encoding DT-A. 
[0018] Figure 4A. Schematic diagram of the "safety" PTM and variations, 

demonstrating the PTM intramolecular base-paired stem, intended to mask the BP and PPT from 
splicing factors (SEQ ED NOS: 54, 55, 56). Underlined sequences represent the pHCG6 intron 1 
complementary target-binding domain, sequence in italics indicate target mismatches that are 
homologous to the BP. 

[0019] Figure 4B. Schematic of a safety PTM in open configuration upon binding to the 

target. 

[0020] Figure 4C. In vitro trans-splicing reactions were carried out by incubating either 

safety PTM or safety PTM variants with the (3HCG6 target. Splicing reactions were amplified 
by RT-PCR using pHCG-F and DT-3R primers; products were analyzed in a 2.0% agarose gel. 
[0021] Figure 5. Specificity of targeted trans-splicing is enhanced by the inclusion of a 

safety into the PTM. PHCG6 pre-mRNA (250 ng) and p-globin pre-mRNA (250 ng) were 
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annealed together with either PTM+SF (safety) or pcPTM+Sp (linear) RNA (500 ng). In vitro 
/raws-splicing reactions and RT-PCR analysis were performed as described under experimental 
procedures and the products were separated on a 2.0% agarose gel. Primers used for RT-PCR 
are as indicated. 

[0001] Figure 6. In the presence of increasing PTM concentration, cz's-splicing is 

inhibited and replaced by fraws-splicing. In vitro splicing reactions were performed in the 
presence of a constant amount of pHCG6 target pre-mRNA (100 ng) with increasing 
concentrations of PTM (pcPTM+Sp) RNA (52-300 ng). RT-PCR for ds-spliced and un-spliced 
products utilized primers (3HCG-F (exon 1 specific) and pHCG-R2 (exon 2 specific - Panel A); 
primers pHCG-F and DT-3R were used to RT-PCR trans-spliced products (Panel B). Reaction 
products were analyzed on 1.5% and 2.0% agarose gels, respectively. In panel A, lane 9 
represents the 60 min time point in the presence of 300 ng of PTM, which is equivalent to lane 
10 in panel B. 

[0002] Figure 7A. PTMs are capable of /raw-splicing in cultured human cancer cells. 

Total RNA was isolated from each of 4 expanded neomycin resistant HI 299 lung carcinoma 
colonies transfected with pcSp+CRM (expressing non-toxic mutant DT-A) RT-PCR was 
performed using 1 jig of total RNA and 5' biotinylated pHCG-F and non-biotinylated DT-3R 
primers. Single stranded DNA was purified and sequenced. 

[0003] Figure 7B. Nucleotide sequence (sense strand) (SEQ ID NO:l) of the trans- 

spliced product between endogenous pHCG6 target and CRM 197 mutant toxin is shown (SEQ 
ID NO: 57). Two arrows indicate the position of the splice junction. 
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[0004] Figure 8 A. Schematic diagram of a double splicing pre-therapeutic mRNA. 

[0005] Figure 8B. Selective fraws-splicing of a double splicing PTM. By varying the 

PTM concentration the PTM can be fraws-spliced into either the 5' or the 3' splice site of the 
target. 

[0006] Figure 9. Schematic diagram of the use of PTM molecules for exon tagging. 

Two examples of PTMs are shown. The PTM on the left is capable of non-specifically trans- 
splicing into a target pre-mRNA 3 1 splice site. The other PTM on the right is designed to non- 
specifically trans-splice into a target pre-mRNA 5 1 splice site. A PTM mediated trans-splicing 
reaction will result in the production of a chimeric RNA comprising a specific tag to either the 5' 
or 3' side of an authentic exon. I 
[0007] Figure 10A. Schematic diagram of constructs for use in the lacZ knock-out 

model. The target lacZ pre-mRNA contains the 5' fragment of lacZ (SEQ ID NO: 58 and SEQ 
ID NO: 67) followed by 0HCG6 intron 1 (SEQ ID NO: 59 and SEQ ID NO: 68) and the 3' 
fragment of lacZ (SEQ ID NO: 60)(target 1). The PTM molecule for use in the model system 
was created by digesting pPTM +SP with PstI and Hindlll and replacing the DT-A toxin with 
[3HCG6 exon 2 (pc3.1PTM2). 

[0008] Figure 10B. Schematic diagram of restoration of p-Gal activity by Spliceosome 

Mediated RNA Trans-splicing. Schematic diagram of constructs for use in the lacZ knock-in 
model (pc.3.1 lacZ T2). The lacZ target pre-mRNA is identical to that target pre-mRNA used 
for the knock-out experiments except that it contains two stop codons (TAA TAA) in frame four 
codons after the 3 1 splice site. The PTM molecule for use in the model system was created by 
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digesting pPTM +SP with PstI and Hindlll and replacing the DT-A toxin with functional 3' 
fragment of lacZ. 

[0009] Figure 11 A. Demonstration of cis-md trans-splicing when utilizing the lacZ 

knock-out model. The LacZ splice target 1 pre-mRNA and PTM2 were co-transfected into 293T 
cells. Total RNA was then isolated and analyzed by PCR for c/s-spliced and trans-spliced 
products using the appropriate specific primers. The amplified PCR products were separated on 
a 2% agarose gel. 

[0010] Figure 11B-C. Assays for p-galactosidase activity. 293 cells were transfected 

with lacZ target 2 DNA alone (panel B) or lacZ target 2 DNA and PTM1 (panel C). 
[0011] Figure 12 A. Nucleotide sequence of trans-spliced molecule demonstrating 

accurate trans-splicing (SEQ ID NO: 61). 

[0012] Figure 12B. Nucleotide sequences of the czs-spliced product and the trans-spliced 

product (SEQ ID NOS: 62, 63). The nucleotide sequences were those sequences expected for 
each of the different splicing reactions. 

[0013] Figure 13. Gene repair model for repair of the cystic fibrosis transmembrane 

regulator (CFTR) gene. 

[0014] Figure 14. RT-PCR demonstration of trans-splicing between an exogenously 

supplied CFTR mini-gene target and PTM. Plasmids were co-transfected into 293 embryonic 
kidney cells. The primers pairs used for RT-PCR reactions are listed above each lane. The 
lower band (471 bp) in each lane represents a trans-spliced product. The lower band in lane 1 
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(471bp) was purified from a 2% Seakem agarose gel and the DNA sequence of the band was 
determined. 

[0015] Figure 15. DNA sequence of the trans-spliced product (lane 1, lower band shown 

in Figure 14) (SEQ ID NO: 64). The DNA sequence indicates the presence of the F508 codon 
(CTT), exon 9 sequence is contiguous with exon 10 sequence, and the His tag sequence. 
[0016] Figure 16. Schematic representation of repair of an exogenously supplied CFTR 

target molecule carrying an F508 deletion in exon 10. 

[0017] Figure 17. Repair of endogenous CFTR transcripts by exon 10 replacement using 

a double splicing PTM. The use of a double splicing PTM permits repair of the a 5 08 mutation 
with a very short PTM molecule. 

[0018] Figure 18. Model lacZ target consisting of lacZ 5' exon - CFTR mini-intron 9 - 

CFTR exon 10 (delta 508) - CFTR mini-intron 10 followed by the lacZ 3' exon. Binding 
domains for PTMs are bracketed. 

[0019] Figure 19. Schematic representation of double-frans-splicing PTMs designed to 

restore (3-gal function. 

[0020] Figure 20. Schematic representation of a double-^rcws-splicing reaction showing 

the binding of DSPTM7 with DSCFT1.6 target pre-mRNA. 

[0021] Figure 21. Important structural elements of DSPTM7. The double splicing PTM 

has both 3* and 5* functional splice sites as well as binding domains. 

[0022] Figure 22. Schematic diagram of mutant double splicing PTMs (SEQ ID NO:85). 

[0023] Figure 23. Accuracy of double-frans-splicing reaction (SEQ ID NOS:86, 87). 
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[0024] Figure 24. Double-fra^s-splicing between the target pre-mRNA and the 

DSPTM7 produces full-length protein. Western blot analysis of total cell lysates using 
polyclonal anti-p-galactosidase antiserum. 

[0025] Figure 25. Precise internal exon substitution between the DSCFT1.6 target pre- 

mRNA and DSPTM7 RNA by double-/raws-splicing produces functionally active P-gal protein. 
Total cell extracts were prepared and assayed for p-gal activity using an ONPG assay. 
[0026] Figure 26. 3' and 5' splice sites are essential for the restoration of P-gal function 

by double-fraws-splicing reaction. 

[0027] Figure 27. Double-^rara-splicing: titration of target and PTM. Different 

concentrations of the target and PTM were co-transfected and analyzed for P-gal activity 
restoration. 

[0028] Figure 28. Constructs designed to test the specificity of double-fraws-splicing 

reaction. 

[0029] Figure 29. Specificity of a double-?ra«s-splicing reaction. 

[0030] Figure 30. Trans-splicing repair of the cystic fibrosis gene using a PTM that 

mediates a double-fra/w-splicing event. 

[0031] Figure 31. PTM with a long binding domain masking two splice sites and part of 

exon 10 in a mini-gene target (SEQ ID NO:83). 

[0032] Figure 32. Sequence of a single PCR product showing target exon 9 correctly 

spliced to PTM exon 10 (with modified codons) (upper panel) (SEQ ID NO:89), codon 508 in 
exon 10 of the PTM (middle panel)(SEQ ID NO:90) and PTM exon 10 correctly spliced to target 
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exon 11 (lower panel)(SEQ ID NO:91). The sequence of a repaired target was generated by 
RT-PCR followed by PCR. 

[0033] Figure 33. Trans-splicing repair of the cystic fibrosis gene using a PTM that can 

perform 5 ! exon replacement. 

[0034] Figure 34. Schematic diagram of three different PTM molecules with different 

binding domains. 

[0035] Figure 35. Schematic diagram of PTM exon 10 with modified codon usage to 

reduce antisense effects with its own binding domain (SEQ ID NO:92). 

[0036] Figure 36. Sequence of cis- and /rara-spliced products (SEQ ID NOS:93, 94, 95, 

96, 97). 

[0037] Figure 37. Model system for repair of messenger RNAs by /rans-splicing. 

(A) Schematic illustration of a defective lacZCF9m splice target used in the present study (see 
Materials and Methods for details). BP, branch point; PPT, polypyrimidine tracts; ss, splice sites 
and pA, polyadenylation signal (SEQ ID NO:98, 99) . (B) A prototype PTM showing the key 
components of the fraws-splicing domain (SEQ ID NO: 100), and the diagrams of various PTMs 
showing the binding domain length and approximate positions at which they bind to the target 
pre-mRNA. Unique restriction sites within the frans-splicing domain are N, Nhe I; S, Sac II; K, 
Kpn I and E, EcoR V. (C) Schematic diagram showing the binding of a PTM through antisense 
binding and repair of defective lacZ pre-mRNA through targeted RNA frans-splicing. Expected 
cis and /raws-spliced products and the primer binding sites for Lac-9F, Lac-3R and Lac-5R are 
indicated. 
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[0038] Figure 38. Efficient repair of lacZ messenger RNA. Target specific primers, Lac- 

9F (5' exon) and Lac-3R (3' exon) were used to amplify ds-spliced products (lanes 1-6), while; 
target and PTM specific primers, Lac-9F (5' exon) and Lac-5R (3 1 exon) were used to amplify 
fraws-spliced products (lanes 7-15). 25-50 ng of total RNA was used to measure target cis- 
splicing (lanes 1-6) and 50-200 ng of total RNA was used to measure PTM induced RNA trans- 
splicing (lanes 7-12). Lanes 13-15, 25-50 ng of total RNA from cells transfected with lacZCF9 a 
control for trans-splicing, (B) Endogenous mRNA repair by trans-splicing. Lanes 1-3, RNA 
from cells transfected with PTM-CF14; lanes 4-6, PTM-CF22 and lanes 7-9, PTM-CF24. Lane 
10, RNA from mock-transfected cells and lane 11 is a control in which reverse-transcription 
reaction was omitted. 

[0039] Figure 39. Messenger RNA repair leads to synthesis of full-length 

p-galactosidase. Lane 1, lacZCF9 (positive control, 5 /ug); lane 2, lacZCF9m target alone 
(25 jLtg); lane 3, PTM-CF24 alone (25 //g) and lane 4, lacZCF9m target + PTM-CF24 (25 //g). 
[0040] Figure 40. Messenger RNA repair by SMaRT produces functional 

P-galactosidase. (A) In situ detection of functional p-galactosidase produced by trans-splicing. 
293T cells were either transfected (transient assay) with lacZCF9m target alone (panel A) or co- 
transfected with lacZCF9m target + PTM-CF24 (panel B) expression plasmids as described 
above. 48-hr post-transfection, cells were rinsed with PBS and stained in situ for p-gal activity. 
(B) Repair of a defective lacZ mRNA produces functional p-galactosidase. Target and PTM, 
extracts from cells transfected with either lacZCF9m target or PTM-CF24 plasmid alone, and the 
rest were from cells co-transfected with lacZCF9m target and one of the PTMs as indicated. (C) 
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Endogenous mRNA repair by trans-splicing produces functional p-galactosidase. Stable cells 
expressing an endogenous lacZCF9m pre-mRNA target was transfected with "linear" PTMs 
(PTM-CF14, PTM-CF22 or PTM-CF24) as described above. Following transfection, total cell 
lysate was prepared and assayed for P-gal activity. The results presented are the average of two 
independent transfections. 

[0041] Figure 41. Messenger RNA repair is specific. (A) Experimental strategy to 

measure non-specific trans-splicing between lacZHCGlm pre-mRNA and "linear" PTMs. (B) 
Extended binding domains enhance the specificity of trans-splicing. Lanes 1-3, PTM-CF14; 4-6, 
PTM-CF22; 7-9, PTM-CF24; 10-12, PTM-CF26 and 13-15, PTM-CF27. (C) PTMs with very 
long binding domains are capable of increasing specificity. Total cell extract (5 was assayed 
in solution for P-gal activity and the specific activity was calculated, p-gal activity was 
normalized to mock and the results presented are the average of two independent transfections. 
Control, extract from cells transfected with lacZHCGlm target alone and the rest were co- 
transfected with lacZHCGlm target and one of the linear PTMs. 

[0042] Figure 42. Complete sequence of CFTR PTM 30 (5* exon replacement PTM) 

showing the trans-splicing domain (underlined) (SEQ ID NO: 102) and the coding sequence for 
exons 1-10 of the CFTR gene (SEQ ID NO:101) . Modified codons in exon 10 are underlined 
and bold. 

[0043] Figure 43A. 153 base-pair PTM 24 Binding Domain (SEQ ID NO:103). 

[0044] Figure 43B. Complete sequence of CFTR PTM 24 (3* exon replacement PTM) 

showing the trans-splicing domain (underlined) (SEQ ID NO: 104) and the coding sequence for 
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exons 10-24 of the CFTR cDNA (SEQ ID NO: 105). At the end of the coding is a histidine tag 
and the translation stop codon. 

5. DETAILED DESCRIPTION OF THE INVENTION 
[0045] The present invention relates to compositions comprising pre-fraws-splicing 

molecules (PTMs) and the use of such molecules for generating novel nucleic acid molecules. 
The PTMs of the invention comprise one or more target binding domains that are designed to 
specifically bind to pre-mRNA, a 3' splice region that includes a branch point, pyrimidine tract 
and a 3' splice acceptor site and/or a 5 f splice donor site; and one or more spacer regions that 
separate the RNA splice site from the target binding domain. In addition, the PTMs of the 
invention can be engineered to contain any nucleotide sequences such as those encoding a 
translatable protein product. 

[0046] The methods of the invention encompass contacting the PTMs of the invention 

with a natural pre-mRNA under conditions in which a portion of the PTM is /ra«s-spliced to a 
portion of the natural pre-mRNA to form a novel chimeric RNA. The target pre-mRNA is 
chosen as a target due to its expression within a specific cell type thus providing a mechanism 
for targeting expression of a novel RNA to a selected cell type. The resulting chimeric RNA 
may provide a desired function, or may produce a gene product in the specific cell type. The 
specific cells may include, but are not limited to those infected with viral or other infectious 
agents, benign or malignant neoplasms, or components of the immune system which are 
involved in autoimmune disease or tissue rejection. Specificity is achieved by modification of 
the binding domain of the PTM to bind to the target endogenous pre-mRNA. The gene products 
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encoded by the chimeric RNA can be any gene, including genes having clinical usefulness, for 
example, therapeutic or marker genes, and genes encoding toxins. 

5.1. STRUCTURE OF THE PRE-77L4Aff-SPLICING MOLECULES 
[0047] The present invention provides compositions for use in generating novel chimeric 

nucleic acid molecules through targeted trans-splicing. The PTMs of the invention comprise (i) 
one or more target binding domains that targets binding of the PTM to a pre-mRNA (ii) a 3' 
splice region that includes a branch point, pyrimidine tract and a 3* splice acceptor site and/or 5' 
splice donor site; and (iii) one or more spacer regions to separate the RNA splice site from the 
target binding domain. Additionally, the PTMs can be engineered to contain any nucleotide 
sequence encoding a translatable protein product. In yet another embodiment of the invention, 
the PTMs can be engineered to contain nucleotide sequences that inhibit the translation of the 
chimeric RNA molecule. For example, the nucleotide sequences may contain translational stop 
codons or nucleotide sequences that form secondary structures and thereby inhibit translation. 
Alternatively, the chimeric RNA may function as an antisense molecule thereby inhibiting 
translation of the RNA to which it binds. 

[0048] The target binding domain of the PTM may contain multiple binding domains 

which are complementary to and in anti-sense orientation to the targeted region of the selected 
pre-mRNA. As used herein, a target binding domain is defined as any sequence that confers 
specificity of binding and anchors the pre-mRNA closely in space so that the spliceosome 
processing machinery of the nucleus can trans-splice a portion of the PTM to a portion of the 
pre-mRNA. The target binding domains may comprise up to several thousand nucleotides. In 
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preferred embodiments of the invention the binding domains may comprise at least 10 to 30 and 
up to several hundred nucleotides. As demonstrated herein, the specificity of the PTM can be 
increased significantly by increasing the length of the target binding domain. For example, the 
target binding domain may comprise several hundred nucleotides or more. In addition, although 
the target binding domain may be "linear 1 ' it is understood that the RNA may fold to form 
secondary structures that may stabilize the complex thereby increasing the efficiency of splicing. 
A second target binding region may be placed at the 3 ! end of the molecule and can be 
incorporated into the PTM of the invention. Absolute complementarity, although preferred, is 
not required. A sequence "complementary" to a portion of an RNA, as referred to herein, means 
a sequence having sufficient complementarity to be able to hybridize with the RNA, forming a 
stable duplex. The ability to hybridize will depend on both the degree of complementarity and 
the length of the nucleic acid (See, for example, Sambrook et al., 1989, Molecular Cloning, A 
Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York). Generally, the longer the hybridizing nucleic acid, the more base mismatches with an 
RNA it may contain and still form a stable duplex. One skilled in the art can ascertain a tolerable 
degree of mismatch or length of duplex by use of standard procedures to determine the stability 
of the hybridized complex. 

[0049] Where the PTMs are designed for use in intron-exon tagging or for peptide 

affinity tagging, a library of PTMs is genetically engineered to contain random nucleotide 
sequences in the target binding domain. Alternatively, for intron-exon tagging the PTMs may be 
genetically engineered so as to lack target binding domains. The goal of generating such a 
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library of PTM molecules is that the library will contain a population of PTM molecules capable 
of binding to each RNA molecule expressed in the cell. A recombinant expression vector can be 
genetically engineered to contain a coding region for a PTM including a restriction endonuclease 
site that can be used for insertion of random DNA fragments into the PTM to form random target 
binding domains. The random nucleotide sequences to be included in the PTM as target binding 
domains can be generated using a variety of different methods well known to those of skill in the 
art, including but not limited to, partial digestion of DNA with restriction enzymes or mechanical 
shearing of DNA to generate random fragments of DNA. Random binding domain regions may 
also be generated by degenerate oligonucleotide synthesis. The degenerate oligonucleotides can 
be engineered to have restriction endonuclease recognition sites on each end to facilitate cloning 
into a PTM molecule for production of a library of PTM molecules having degenerate binding 
domains. 

[0050] Binding may also be achieved through other mechanisms, for example, through 

triple helix formation or protein/nucleic acid interactions such as those in which the PTM is 
engineered to recognize a specific RNA binding protein, i.e., a protein bound to a specific target 
pre-mRNA. Alternatively, the PTMs of the invention may be designed to recognize secondary 
structures, such as for example, hairpin structures resulting from intramolecular base pairing 
between nucleotides within an RNA molecule. 

[0051] The PTM molecule also contains a 3 ! splice region that includes a branch point, 

pyrimidine tract and a 3* splice acceptor AG site and/or a 5' splice donor site. Consensus 
sequences for the 5' splice donor site and the 3' splice region used in RNA splicing are well 
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known in the art (See, Moore, et al, 1993, The RNA World, Cold Spring Harbor Laboratory 
Press, p. 303-358). In addition, modified consensus sequences that maintain the ability to 
function as 5' donor splice sites and 3' splice regions may be used in the practice of the 
invention. Briefly, the 5* splice site consensus sequence is AG/GURAGU (where A=adenosine, 
U=uracil, G=guanine, C=cytosine, R=purine and /=the splice site). The 3 f splice site consists of 
three separate sequence elements: the branch point or branch site, a polyp yrimidine tract and the 
3' consensus sequence (YAG). The branch point consensus sequence in mammals is 
YNYURAC (Y=pyrimidine). The underlined A is the site of branch formation. A 
polypyrimidine tract is located between the branch point and the splice site acceptor and is 
important for different branch point utilization and 3 f splice site recognition. 
[0052] Further, PTMs comprising a 3' acceptor site (AG) may be genetically engineered. 

Such PTMs may further comprise a pyrimidine tract and/or branch point sequence. 
[0053] Recently, pre-messenger RNA introns beginning with the dinucleotide AU and 

ending with the dinucleotide AC have been identified and referred to as U12 introns. U12 intron 
sequences as well as any sequences that function as splice acceptor/donor sequences may also be 
used in PTMs. 

[0054] A spacer region to separate the RNA splice site from the target binding domain is 

also included in the PTM. The spacer region can have features such as stop codons which would 
block any translation of an unspliced PTM and/or sequences that enhance trans-splicing to the 
target pre-mRNA. 
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[0055] In a preferred embodiment of the invention, a "safety" is also incorporated into 

the spacer, binding domain, or elsewhere in the PTM to prevent non-specific trans-splicing. 
This is a region of the PTM that covers elements of the 3' and/or 5' splice site of the PTM by 
relatively weak complementarity, preventing non-specific trans-splicing. The PTM is designed 
in such a way that upon hybridization of the binding /targeting portion(s) of the PTM, the 3' 
and/or 5 'splice site is uncovered and becomes fully active. 

[0056] The "safety" consists of one or more complementary stretches of m-sequence (or 

could be a second, separate, strand of nucleic acid) which weakly binds to one or both sides of 
the PTM branch point, pyrimidine tract, 3' splice site and/or 5 f splice site (splicing elements), or 
could bind to parts of the splicing elements themselves. This "safety" binding prevents the 
splicing elements from being active (i.e. block U2 snRNP or other splicing factors from attaching 
to the PTM splice site recognition elements). The binding of the "safety" may be disrupted by 
the binding of the target binding region of the PTM to the target pre-mRNA, thus exposing and 
activating the PTM splicing elements (making them available to trans-splice into the target pre- 
mRNA). 

[0057] A nucleotide sequence encoding a translatable protein capable of producing an 

effect, such as cell death, or alternatively, one that restores a missing function or acts as a 
marker, is included in the PTM of the invention. For example, the nucleotide sequence can 
include those sequences encoding gene products missing or altered in known genetic diseases. 
Alternatively, the nucleotide sequences can encode marker proteins or peptides which may be 
used to identify or image cells. In yet another embodiment of the invention nucleotide sequences 
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encoding affinity tags such as, HIS tags (6 consecutive histidine residues) (Janknecht, et al., 
1991, Proc. Natl. Acad. Sci. USA 88:8972-8976), the C-terminus of glutathione-S-transferase 
(GST) (Smith and Johnson, 1986, Proc. Natl. Acad. Sci. USA 83:8703-8707) (Pharmacia) or 
FLAG (Asp-Tyr-Lys-Asp-Asp-Asp-Lys) (SEQ ID NO: 66) (Eastman Kodak/IBI, Rochester, 
NY) can be included in PTM molecules for use in affinity purification. The use of PTMs 
containing such nucleotide sequences results in the production of a chimeric RNA encoding a 
fusion protein containing peptide sequences normally expressed in a cell linked to the peptide 
affinity tag. The affinity tag provides a method for the rapid purification and identification of 
peptide sequences expressed in the cell. In a preferred embodiment the nucleotide sequences 
may encode toxins or other proteins which provide some function which enhances the 
susceptibility of the cells to subsequent treatments, such as radiation or chemotherapy. 
[0058] In a highly preferred embodiment of the invention a PTM molecule is designed to 

contain nucleotide sequences encoding the Diphtheria toxin subunit A (Greenfield, L., et al., 
1983, Proc. Natl Acad. Sci. USA 80: 6853-6857). Diphtheria toxin subunit A contains 
enzymatic toxin activity and will function if expressed or delivered into human cells resulting in 
cell death. Furthermore, various other known peptide toxins may be used in the present 
invention, including but not limited to, ricin, Pseudomonus toxin, Shiga toxin and exotoxin A. 
[0059] Additional features can be added to the PTM molecule either after, or before, the 

nucleotide sequence encoding a translatable protein, such as polyadenylation signals or 5' splice 
sequences to enhance splicing, additional binding regions, "safety"-self complementary regions, 
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additional splice sites, or protective groups to modulate the stability of the molecule and prevent 
degradation. 

[0060] Additional features that may be incorporated into the PTMs of the invention 

include stop codons or other elements in the region between the binding domain and the splice 
site to prevent unspliced pre-mRNA expression. In another embodiment of the invention, PTMs 
can be generated with a second anti-sense binding domain downstream from the nucleotide 
sequences encoding a translatable protein to promote binding to the 3 f target intron or exon and 
to block the fixed authentic cis-5' splice site (U5 and/or Ul binding sites). 
[0061] PTMs may also be generated that require a double-fraws-splicing reaction for 

generation of a chimeric trans-spliced product. Such PTMs could be used to replace an internal 
exon which could be used for RNA repair. PTMs designed to promote two trans-splicing 
reactions are engineered as described above, however, they contain both 5' donor sites and 3' 
splice acceptor sites. In addition, the PTMs may comprise two or more binding domains and 
splicer regions. The splicer regions may be place between the multiple binding domains and 
splice sites or alternatively between the multiple binding domains. 

[0062] Further elements such as a 3' hairpin structure, circularized RNA, nucleotide base 

modification, or a synthetic analog can be incorporated into PTMs to promote or facilitate 
nuclear localization and spliceosomal incorporation, and intra-cellular stability. 
[0063] Additionally, when engineering PTMs for use in plant cells it may not be 

necessary to include conserved branch point sequences or polypyrimidine tracts as these 
sequences may not be essential for intron processing in plants. However, a 3' splice acceptor site 
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and/or 5 1 splice donor site, such as those required for splicing in vertebrates and yeast, will be 
included. Further, the efficiency of splicing in plants may be increased by also including UA- 
rich intronic sequences. The skilled artisan will recognize that any sequences that are capable of 
mediating a trans-splicing reaction in plants may be used. 

[0064] The PTMs of the invention can be used in methods designed to produce a novel 

chimeric RNA in a target cell. The methods of the present invention comprise delivering to the 
target cell a PTM which may be in any form used by one skilled in the art, for example, an RNA 
molecule, or a DNA vector which is transcribed into a RNA molecule, wherein said PTM binds 
to a pre-mRNA and mediates a trans-splicing reaction resulting in formation of a chimeric RNA 
comprising a portion of the PTM molecule spliced to a portion of the pre-mRNA. 

5.2. SYNTHESIS OF THE TRANS-SPLICING MOLECULES 
[0065] The nucleic acid molecules of the invention can be RNA or DNA or derivatives or 

modified versions thereof, single-stranded or double-stranded. By nucleic acid is meant a PTM 
molecule or a nucleic acid molecule encoding a PTM molecule, whether composed of 
deoxyribonucleotides or ribonucleosides, and whether composed of phosphodiester linkages or 
modified linkages. The term nucleic acid also specifically includes nucleic acids composed of 
bases other than the five biologically occurring bases (adenine, guanine, thymine, cytosine and 
uracil). 

[0066] The RNA and DNA molecules of the invention can be prepared by any method 

known in the art for the synthesis of DNA and RNA molecules. For example, the nucleic acids 
may be chemically synthesized using commercially available reagents and synthesizers by 
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methods that are well known in the art (see, e.e., Gait, 1985, Oligonucleotide Synthesis: A 
Practical Approach, IRL Press, Oxford, England). Alternatively, RNA molecules can be 
generated by in vitro and in vivo transcription of DNA sequences encoding the RNA molecule. 
Such DNA sequences can be incorporated into a wide variety of vectors which incorporate 
suitable RNA polymerase promoters such as the T7 or SP6 polymerase promoters. RNAs may 
be produced in high yield via in vitro transcription using plasmids such as SPS65 (Promega 
Corporation, Madison, WI). In addition, RNA amplification methods such as Q-p amplification 
can be utilized to produce RNAs. 

[0067] The nucleic acid molecules can be modified at the base moiety, sugar moiety, or 

phosphate backbone, for example, to improve stability of the molecule, hybridization, transport 
into the cell, etc. For example, modification of a PTM to reduce the overall charge can enhance 
the cellular uptake of the molecule. In addition modifications can be made to reduce 
susceptibility to nuclease degradation. The nucleic acid molecules may include other appended 
groups such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et aL, 1989, Proc. Natl. Acad. Sci. U.S.A. 
86:6553-6556; Lemaitre et aL, 1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. 
W088/09810, published December 15, 1988) or the blood-brain barrier (see, e.g., PCT 
Publication No. W089/10134, published April 25, 1988), hybridization-triggered cleavage 
agents. (See, e.g., Krol et aL, 1988, BioTechniques 6:958-976) or intercalating agents. (See, 
e.g., Zon, 1988, Pharm. Res. 5:539-549). To this end, the nucleic acid molecules may be 
conjugated to another molecule, e.g., a peptide, hybridization triggered cross-linking agent, 
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transport agent, hybridization-triggered cleavage agent, etc. Various other well-known 
modifications to the nucleic acid molecules can be introduced as a means of increasing 
intracellular stability and half-life. Possible modifications include, but are not limited to, the 
addition of flanking sequences of ribo- or deoxy- nucleotides to the 5' and/or 3' ends of the 
molecule. In some circumstances where increased stability is desired, nucleic acids having 
modified internucleoside linkages such as 2 ! -0-methylation may be preferred. Nucleic acids 
containing modified internucleoside linkages may be synthesized using reagents and methods 
that are well known in the art (see, Uhlmann et al 9 1990, Chem. Rev. 90:543-584; Schneider et 
al, 1990, Tetrahedron Lett. 31:335 and references cited therein). 

[0068] The nucleic acids may be purified by any suitable means, as are well known in the 

art. For example, the nucleic acids can be purified by reverse phase chromatography or gel 
electrophoresis. Of course, the skilled artisan will recognize that the method of purification will 
depend in part on the size of the nucleic acid to be purified. 

[0069] In instances where a nucleic acid molecule encoding a PTM is utilized, cloning 

techniques known in the art may be used for cloning of the nucleic acid molecule into an 
expression vector. Methods commonly known in the art of recombinant DNA technology which 
can be used are described in Ausubel et al (eds.), 1993, Current Protocols in Molecular Biology, 
John Wiley & Sons, NY; and Kriegler, 1990, Gene Transfer and Expression, A Laboratory 
Manual, Stockton Press, NY. 

[0070] The DNA encoding the PTM of interest may be recombinantly engineered into a 

variety of host vector systems that also provide for replication of the DNA in large scale and 
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contain the necessary elements for directing the transcription of the PTM. The use of such a 
construct to transfect target cells in the patient will result in the transcription of sufficient 
amounts of PTMs that will form complementary base pairs with the endogenously expressed pre- 
mRNA targets and thereby facilitate a trans-splicing reaction between the complexed nucleic 
acid molecules. For example, a vector can be introduced in vivo such that it is taken up by a cell 
and directs the transcription of the PTM molecule. Such a vector can remain episomal or 
become chromosomally integrated, as long as it can be transcribed to produce the desired RNA. 
Such vectors can be constructed by recombinant DNA technology methods standard in the art. 
[0071] Vectors encoding the PTM of interest can be plasmid, viral, or others known in 

the art, used for replication and expression in mammalian cells. Expression of the sequence 
encoding the PTM can be regulated by any promoter known in the art to act in mammalian, 
preferably human cells. Such promoters can be inducible or constitutive. Such promoters 
include but are not limited to: the SV40 early promoter region (Benoist, C. and Chambon, P. 
1981, Nature 290:304-310), the promoter contained in the 3' long terminal repeat of Rous 
sarcoma virus (Yamamoto et al, 1980, Cell 22:787-797), the herpes thymidine kinase promoter 
(Wagner et al. 9 1981, Proc. Natl. Acad. Sci. U.S.A. 78:14411445), the regulatory sequences of 
the metallothionein gene (Brinster et al, 1982, Nature 296:39-42), the viral CMV promoter, the 
human chorionic gonadotropin- p promoter (Hollenberg et al., 1994, Mol. Cell. Endocrinology 
106: 1 1 1-1 19), etc. Any type of plasmid, cosmid, YAC or viral vector can be used to prepare the 
recombinant DNA construct which can be introduced directly into the tissue site. Alternatively, 
viral vectors can be used which selectively infect the desired target cell. 
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[0072] For use of PTMs encoding peptide affinity purification tags, it is desirable to 

insert nucleotide sequences containing random target binding sites into the PTMs and clone them 
into a selectable mammalian expression vector system. A number of selection systems can be 
used, including but not limited to selection for expression of the herpes simplex virus thymidine 
kinase, hypoxanthine-guanine phosphoribosyltransterase and adenine phosphoribosyl transferase 
protein in tk-, hgprt- or aprt- deficient cells, respectively. Also, anti-metabolic resistance can be 
used as the basis of selection for dihydrofolate tranferase (dhfr), which confers resistance to 
methotrexate; xanthine-guanine phosphoribosyl transferase (gpt) 9 which confers resistance to 
mycophenolic acid; neomycin (neo), which confers resistance to aminoglycoside G-418; and 
hygromycin B phosphotransferase (hygro) which confers resistance to hygromycin. In a 
preferred embodiment of the invention, the cell culture is transformed at a low ratio of vector to 
cell such that there will be only a single vector, or a limited number of vectors, present in any 
one cell. Vectors for use in the practice of the invention include any eukaryotic expression 
vectors, including but not limited to viral expression vectors such as those derived from the class 
of retroviruses or adeno-associated viruses. 

5.3. USES AND ADMINISTRATION OF TRANS-SPUCTNG MOLECULES 

5.3.1. USE OF PTM MOLECULES FOR GENE REGULATION, 
GENE REPAIR AND TARGETED CELL DEATH 

[0073] The compositions and methods of the present invention will have a variety of 

different applications including gene regulation, gene repair and targeted cell death. For 
example, trans-splicing can be used to introduce a protein with toxic properties into a cell. In 
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addition, PTMs can be engineered to bind to viral mRNA and destroy the function of the viral 
mRNA, or alternatively, to destroy any cell expressing the viral mRNA. In yet another 
embodiment of the invention, PTMs can be engineered to place a stop codon in a deleterious 
mRNA transcript thereby decreasing the expression of that transcript. 

[0074] Targeted fraws-splicing, including double-rraws-splicing reactions, 3 f exon 

replacement and/or 5' exon replacement can be used to repair or correct transcripts that are either 
truncated or contain point mutations. The PTMs of the invention are designed to cleave a 
targeted transcript upstream or downstream of a specific mutation or upstream of a premature 3' 
and correct the mutant transcript via a trans-splicing reaction which replaces the portion of the 
transcript containing the mutation with a functional sequence. 

[0075] In addition, double trans-splicing reactions may be used for the selective 

expression of a toxin in tumor cells. For example, PTMs can be designed to replace the second 
exon of the human (5-chronic gonadotropin-6 (phCG6) gene transcripts and to deliver an exon 
encoding the subunit A of diptheria toxin (DT-A). Expression of DT-A in the absence of subunit 
B should lead to toxicity only in the cells expressing the gene. (3hCG6 is a prototypical target for 
genetic modification by trans-splicing. The sequence and the structure of the phCG6 gene are 
completely known and the pattern of splicing has been determined. The phCG6 gene is highly 
expressed in many types of solid tumors, including many non-germ line tumors, but the phCG6 
gene is silent in the majority cells in a normal adult. Therefore, the phCG6 pre-mRNA represents 
a desirable target for a trans-splicing reaction designed to produce tumor-specific toxicity. 
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[0076] The first exon of phCG6 pre-mRNA is ideal in that it encodes only five amino 

acids, including the initiator AUG, which should result in minimal interference with the proper 
folding of the DT-A toxin while providing the required signals for effective translation of the 
trans-spliced mRNA. The DT-A exon, which is designed to include a stop codon to prevent 
chimeric protein formation, will be engineered to trans-splice into the last exon of the phCG6 
gene. The last exon of the phCG6 gene provides the construct with the appropriate signals to 
polyadenylate the mRNA and ensure translation. 

[0077] Cystic fibrosis (CF) is one of the most common fatal genetic disease in humans. 

Based on both genetic and molecular analyses, the gene associated with cystic fibrosis has been 
isolated and its protein product deduced (Kerem, B.S. et al., 1989, Science 245:1073-1080; 
Riordan et al., 1989, Science 245: 1066-1 073 ;Rommans, et al., 1989, Science 245:1059-1065). 
The protein product of the CF associated gene is called the cystic fibrosis transmembrane 
conductance regulator (CFTR). In a specific embodiment of the invention, a trans-splicing 
reaction will be used to correct a genetic defect in the DNA sequence encoding the cystic 
fibrosis transmembrane regulator (CFTR) whereby the DNA sequence encoding the cystic 
fibrosis trans-membrane regulator protein is expressed and a functional chloride ion channel is 
produced in the airway epithelial cells of a patient. 

[0078] Population studies have indicated that the most common cystic fibrosis mutation 

is a deletion of the three nucleotides in exon 10 that encode phenylalanine at position 508 of the 
CFTR amino acid sequence. As indicated in Figure 15, a /raws-splicing reaction was capable of 
correcting the deletion at position 508 in the CFTR amino acid sequence. The PTM used for 

NY02:355526.1 33 



A31304-B-A-C 072874.0138 
PATENT 

correction of the genetic defect contained a CFTR BD intron 9 sequence, a spacer sequence, a 
branch point, a polyp yrimidine tract, a 3 1 splice site and a wild type CFTR BD exon 10 sequence 
(Figure 13). The successful correction of the mutated DNA encoding CFTR utilizing a trans- 
splicing reaction supports the general application of PTMs for correction of genetic defects. 
[0079] The methods and compositions of the invention may also be used to regulate gene 

expression in plants. For example, frcws-splicing may be used to place the expression of any 
engineered gene under the natural regulation of a chosen target plant gene, thereby regulating the 
expression of the engineered gene. 7ra«s-splicing may also be used to prevent the expression of 
engineered genes in non-host plants or to convert an endogenous gene product into a more 
desirable product. 

[0080] In a specific embodiment of the invention /ran-splicing may be used to regulate 

the expression of the insecticidal gene that produces Bt toxin {Bacillus thuringiensis). For 
example, the PTM may be designed to trans-splice into an injury response gene (pre-mRNA) 
that is expressed only after an insect bites the plant. Thus, all cells of the plant would carry the 
gene for Bt in the PTM, but the cells would only produce Bt when and where an insect injures 
the plant. The rest of the plant will make little or no Bt. A PTM could trans-splice the Bt gene 
into any chosen gene with a desired pattern of expression. Further, it should be possible to target 
a PTM so' that no Bt is produced in the edible portion of the plant. 

[0081] One advantage associated with the use of PTMs is that the PTM acquires the 

native gene control elements of the target gene, thus, reducing the time and effort that might 
otherwise be spent attempting to identify and reconstitute appropriate regulatory sequences 
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upstream of an engineered gene. Thus, expression of the PTM regulated gene should occur only 
in those plant cells containing the target pre-mRNA. By targeting a gene not expressed in the 
edible portion of the plant or in the pollen, *raws-splicing can alleviate opposition to genetically 
modified plants, as consumers would not be eating the proteins made from modified genes. The 
edible portion of such crops should test negative for genetically modified proteins. 
[0082] In addition, PTM can be targeted to a unique sequence of the host gene that is not 

present in other plants. Therefore, even if the gene (DNA) which encodes the PTM jumps to 
another species of plant, the PTM gene will not have an appropriate target for frans-splicing. 
Thus, /raws-splicing offers a "fail-safe" mode for prevention of gene "jumping" to other plant 
species: the PTM gene will be expressed only in the engineered host plant, which contains the 
appropriate target pre-mRNA. Expression in non-engineered plants would not be possible. 
[0083] Trans-splicing also provides a more efficient way to convert one gene product 

into another. For example, rraws-splicing ribozymes and chimeric oligos can be incorporated 
into corn genomes to modify the ratio of saturated to unsaturated oils. Trans-splicing can also be 
used to convert one gene product into another. 

[0084] Various delivery systems are known and can be used to transfer the compositions 

of the invention into cells, e.g. encapsulation in liposomes, microparticles, microcapsules, 
recombinant cells capable of expressing the composition, receptor-mediated endocytosis (see, 
e.g., Wu and Wu, 1987, J. Biol. Chem. 262:4429-4432), construction of a nucleic acid as part of 
a retroviral or other vector, injection of DNA, electroporation, calcium phosphate mediated 
transfection, etc. 
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[0085] The compositions and methods can be used to treat cancer and other serious viral 

infections, autoimmune disorders, and other pathological conditions in which the alteration or 
elimination of a specific cell type would be beneficial. Additionally, the compositions and 
methods may also be used to provide a gene encoding a functional biologically active molecule 
to cells of an individual with an inherited genetic disorder where expression of the missing or 
mutant gene product produces a normal phenotype. 

[0086] In a preferred embodiment, nucleic acids comprising a sequence encoding a PTM 

are administered to promote PTM function, by way of gene delivery and expression into a host 
cell. In this embodiment of the invention, the nucleic acid mediates an effect by promoting 
PTM production. Any of the methods for gene delivery into a host cell available in the art can 
be used according to the present invention. For general reviews of the methods of gene delivery 
see Strauss, M. and Barranger, J. A., 1997, Concepts in Gene Therapy, by Walter de Gruyter & 
Co., Berlin; Goldspiel et aL, 1993, Clinical Pharmacy 12:488-505; Wu and Wu, 1991, 
Biotherapy 3:87-95; Tolstoshev, 1993, Ann. Rev. Pharmacol. Toxicol. 33:573-596; Mulligan, 
1993, Science 260:926-932; and Morgan and Anderson, 1993, Ann. Rev. Biochem. 62:191-217; 
1993, TIBTECH 1 1(5):155-215. Exemplary methods are described below. 
[0087] Delivery of the nucleic acid into a host cell may be either direct, in which case the 

host is directly exposed to the nucleic acid or nucleic acid-carrying vector, or indirect, in which 
case, host cells are first transformed with the nucleic acid in vitro, then transplanted into the host. 
These two approaches are known, respectively, as in vivo or ex vivo gene delivery. 
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[0088] In a specific embodiment, the nucleic acid is directly administered in vivo, where 

it is expressed to produce the PTM. This can be accomplished by any of numerous methods 
known in the art, e.g., by constructing it as part of an appropriate nucleic acid expression vector 
and administering it so that it becomes intracellular, e.g. by infection using a defective or 
attenuated retroviral or other viral vector (see U.S. Patent No. 4,980,286), or by direct injection 
of naked DNA, or by use of microparticle bombardment {e.g., a gene gun; Biolistic, Dupont), or 
coating with lipids or cell-surface receptors or transfecting agents, encapsulation in liposomes, 
microparticles, or microcapsules, or by administering it in linkage to a peptide which is known to 
enter the nucleus, by administering it in linkage to a ligand subject to receptor-mediated 
endocytosis (see e.g., Wu and Wu, 1987, J. Biol. Chem. 262:4429-4432). 

[0089] In a specific embodiment, a viral vector that contains the PTM can be used. For 

example, a retroviral vector can be utilized that has been modified to delete retroviral sequences 
that are not necessary for packaging of the viral genome and integration into host cell DNA (see 
Miller et a/., 1993, Meth. Enzymol. 217:581-599). Alternatively, adenoviral or adeno-associated 
viral vectors can be used for gene delivery to cells or tissues. (See, Kozarsky and Wilson, 1993, 
Current Opinion in Genetics and Development 3:499-503 for a review of adeno virus-based gene 
delivery). 

[0090] Another approach to gene delivery into a cell involves transferring a gene to cells 

in tissue culture by such methods as electroporation, lipofection, calcium phosphate mediated 
transfection, or viral infection. Usually, the method of transfer includes the transfer of a 
selectable marker to the cells. The cells are then placed under selection to isolate those cells that 
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have taken up and are expressing the transferred gene. The resulting recombinant cells can be 
delivered to a host by various methods known in the art. In a preferred embodiment, the cell 
used for gene delivery is autologous to the host cell. 

[0091] The present invention also provides for pharmaceutical compositions comprising 

an effective amount of a PTM or a nucleic acid encoding a PTM, and a pharmaceutically 
acceptable carrier. In a specific embodiment, the term "pharmaceutically acceptable" means 
approved by a regulatory agency of the Federal or a state government or listed in the U.S. 
Pharmacopeia or other generally recognized pharmacopeia for use in animals, and more 
particularly in humans. The term "carrier" refers to a diluent, adjuvant, excipient, or vehicle with 
which the therapeutic is administered. Examples of suitable pharmaceutical carriers are 
described in "Remington's Pharmaceutical sciences" by E.W. Martin. 

[0092] In specific embodiments, pharmaceutical compositions are administered: (1) in 

diseases or disorders involving an absence or decreased (relative to normal or desired) level of an 
endogenous protein or function, for example, in hosts where the protein is lacking, genetically 
defective, biologically inactive or underactive, or under expressed; or (2) in diseases or disorders 
wherein, in vitro or in vivo, assays indicate the utility of PTMs that inhibit the function of a 
particular protein . The activity of the protein encoded for by the chimeric mRNA resulting 
from the PTM mediated trans-splicing reaction can be readily detected, e.g., by obtaining a host 
tissue sample (e.g., from biopsy tissue) and assaying it in vitro for mRNA or protein levels, 
structure and/or activity of the expressed chimeric mRNA. Many methods standard in the art can 
be thus employed, including but not limited to immunoassays to detect and/or visualize the 
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protein encoded for by the chimeric mRNA (e.g., Western blot, immunoprecipitation followed 
by sodium dodecyl sulfate polyacrylamide gel electrophoresis, immunocytochemistry, etc.) 
and/or hybridization assays to detect formation of chimeric mRNA expression by detecting 
and/or visualizing the presence of chimeric mRNA (e.g., Northern assays, dot blots, in situ 
hybridization, and Reverse-Transcription PCR, etc.), etc. 

[0093] The present invention also provides for pharmaceutical compositions comprising 

an effective amount of a PTM or a nucleic acid encoding a PTM, and a pharmaceutically 
acceptable carrier. In a specific embodiment, the term "pharmaceutically acceptable" means 
approved by a regulatory agency of the Federal or a state government or listed in the U.S. 
Pharmacopeia or other generally recognized pharmacopeia for use in animals, and more 
particularly in humans. The term "carrier" refers to a diluent, adjuvant, excipient, or vehicle with 
which the therapeutic is administered. Examples of suitable pharmaceutical carriers are 
described in "Remington's Pharmaceutical sciences" by E.W. Martin. In a specific embodiment, 
it may be desirable to administer the pharmaceutical compositions of the invention locally to the 
area in need of treatment. This may be achieved by, for example, and not by way of limitation, 
local infusion during surgery, topical application, e.g., in conjunction with a wound dressing 
after surgery, by injection, by means of a catheter, by means of a suppository, or by means of an 
implant, said implant being of a porous, non-porous, or gelatinous material, including 
membranes, such as sialastic membranes, or fibers. Other control release drug delivery systems, 
such as nanoparticles, matrices such as controlled-release polymers, hydrogels. 
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[0094] The PTM will be administered in amounts which are effective to produce the 

desired effect in the targeted cell. Effective dosages of the PTMs can be determined through 
procedures well known to those in the art which address such parameters as biological half-life, 
bioavailability and toxicity. The amount of the composition of the invention which will be 
effective will depend on the nature of the disease or disorder being treated, and can be 
determined by standard clinical techniques. In addition, in vitro assays may optionally be 
employed to help identify optimal dosage ranges. 

[0095] The present invention also provides a pharmaceutical pack or kit comprising one 

or more containers filled with one or more of the ingredients of the pharmaceutical compositions 
of the invention optionally associated with such container(s) can be a notice in the form 
prescribed by a governmental agency regulating the manufacture, use or sale of pharmaceuticals 
or biological products, which notice reflects approval by the agency of manufacture, use or sale 
for human administration. 

5.3.2. USE OF PTM MOLECULES FOR EXON TAGGING 
[0096] In view of current efforts to sequence and characterize the genomes of humans 

and other organisms, there is a need for methods that facilitate such characterization. A majority 
of the information currently obtained by genomic mapping and sequencing is derived from 
complementary DNA (cDNA) libraries, which are made by reverse transcription of mRNA into 
cDNA. Unfortunately, this process causes the loss of information concerning intron sequences 
and the location of exon/intron boundaries. 
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[0097] The present invention encompasses a method for mapping exon-intron boundaries 

in pre-mRNA molecules comprising (i) contacting a pre-trans-splicing molecule with a pre- 
mRNA molecule under conditions in which a portion of the pre-trans-splicing molecule is trans- 
spliced to a portion of the target pre-mRNA to form a chimeric mRNA; (ii) amplifying the 
chimeric mRNA molecule; (iii) selectively purifying the amplified molecule; and (iv) 
determining the nucleotide sequence of the amplified molecule thereby identifying the intron- 
exon boundaries. 

[0098] In an embodiment of the present invention, PTMs can be used in trans-splicing 

reactions to locate exon-intron boundaries in pre-mRNAs molecules. PTMs for use in mapping 
of intron-exon boundaries have structures similar to those described above in Section 5.1. 
Specifically, the PTMs contain (i) a target binding domain that is designed to bind to many pre- 
mRNAs: (ii) a 3' splice region that includes a branch point, pyrimidine tract and a 3 ! splice 
acceptor site, or a 5' splice donor site; (iii) a spacer region that separates the mRNA splice site 
from the target binding domain; and (iv) a tag region that will be trans-spliced onto a pre-mRNA. 
Alternatively, the PTMs to be used to locate exon-intron boundaries may be engineered to 
contain no target binding domain. 

[0099] For purposes of intron-exon mapping, the PTMs are genetically engineered to 

contain target binding domains comprising random nucleotide sequences. The random 
nucleotide sequences contain at least 15-30 and up to several hundred nucleotide sequences 
capable of binding and anchoring a pre-mRNA so that the spliceosome processing machinery of 
the nucleus can trans-splice a portion (tag or marker region) of the PTM to a portion of the pre- 
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mRNA. PTMs containing short target binding domains, or containing inosines bind under less 
stringent conditions to the pre-mRNA molecules. In addition, strong branch point sequences and 
pyrimidine tracts serve to increase the non-specificity of PTM trans-splicing. 
[00100] The random nucleotide sequences used as target binding domains in the PTM 

molecules can be generated using a variety of different methods, including, but not limited to, 
partial digestion of DNA with restriction endonucleases or mechanical shearing of the DNA. 
The use of such random nucleotide sequences is designed to generate a vast array of PTM 
molecules with different binding activities for each target pre-mRNA expressed in a cell. 
Randomized libraries of oligonucleotides can be synthesized with appropriate restriction 
endonucleases recognition sites on each end for cloning into PTM molecules genetically 
engineered into plasmid vectors. When the randomized oligonucleotides are litigated and 
expressed, a randomized binding library of PTMs is generated. 

[00101] In a specific embodiment of the invention, an expression library encoding PTM 

molecules containing target binding domains comprising random nucleotide sequences can be 
generated using a variety of methods which are well known to those of skill in the art. Ideally, 
the library is complex enough to contain PTM molecules capable of interacting with each target 
pre-mRNA expressed in a cell. 

[00102] By way of example, Figure 9 is a schematic representation of two forms of PTMs 

which can be utilized to map intron-exon boundaries. The PTM on the left is capable of non- 
specifically trans-splicing into a pre-mRNA 3 f splice site, while the PTM on the right is capable 
of trans-splicing into a pre-mRNA 5' splice site. Trans-splicing between the PTM and the target 
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pre-mRNA results in the production of a chimeric mRNA molecule having a specific nucleotide 
sequence "tag" on either the 3* or 5* end of an authentic exon. 

[00103] Following selective purification, a DNA sequencing reaction is then performed 

using a primer which begins in the tag nucleotide sequence of the PTM and proceeds into the 
sequence of the tagged exon. The sequence immediately following the last nucleotide of the tag 
nucleotide sequence represents an exon boundary. For identification of intron-exon tags, the 
trans-splicing reactions of the invention can be performed either in vitro or in vivo using methods 
well known to those of skill in the art. 

5.3.3. USE OF PTM MOLECULES FOR DDENTIFICATION 

OF PROTEINS EXPRESSED IN A CELL 

[00104] In yet another embodiment of the invention, PTM mediated trans-splicing 

reactions can be used to identify previously undetected and unknown proteins expressed in a cell. 
This method is especially useful for identification of proteins that cannot be detected by a two- 
dimensional electrophoresis, or by other methods, due to inter alia the small size of the protein, 
low concentration of the protein, or failure to detect the protein due to similar migration patterns 
with other proteins in two-dimensional electrophoresis. 

[00105] The present invention relates to a method for identifying proteins expressed in a 

cell comprising (i) contacting a pre-trans-splicing molecule containing a random target binding 
domain and a nucleotide sequence encoding a peptide tag with a pre-mRNA molecule under 
conditions in which a portion of the pre-trans-splicing molecule is trans-spliced to a portion of 
the target pre-mRNA to form a chimeric mRNA encoding a fusion polypeptide or separating it 
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by gel electrophoresis (ii) affinity purifying the fusion polypeptide; and (iii) determining the 
amino acid sequence of the fusion protein. 

[00106] To identify proteins expressed in a cell, the PTMs of the invention are genetically 

engineered to contain: (i) a target binding domain comprising randomized nucleotide sequences; 
(ii) a 3' splice region that includes a branch point, pyrimidine tract and a 3 ! splice acceptor site 
and/or a 5' splice donor site; (iii) a spacer region that separates the PTM splice site from the 
target binding domain; and (iv) nucleotide sequences encoding a marker or peptide affinity 
purification tag. Such peptide tags include, but are not limited to, HIS tags (6 histidine 
consecutive residues) (Janknecht, etal., 1991 Proc. Natl. Acad. Sci. USA 88:8972-8976), 
glutathione-S-transferase (GST) (Smith, D.B. and Johnson K.S., 1988, Gene 67:31) (Pharmacia) 
or FLAG (Kodak/IBI) tags (Nisson, J. et al. J. Mol. Recognit., 1996, 5:585-594). 
[00107] rra/zs-splicing reactions using such PTMs results in the generation of chimeric 

mRNA molecules encoding fusion proteins comprising protein sequences normally expressed in 
a cell linked to a marker or peptide affinity purification tag. The desired goal of such a method is 
that every protein synthesized in a cell receives a marker or peptide affinity tag thereby 
providing a method for identifying each protein expressed in a cell. 

[00108] In a specific embodiment of the invention, PTM expression libraries encoding 

PTMs having different target binding domains comprising random nucleotide sequences are 
generated. The desired goal is to create a PTM expression library that is complex enough to 
produce a PTM capable of binding to each pre-mRNA expressed in a cell. Li a preferred 
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embodiment, the library is cloned into a mammalian expression vector that results in one, or at 
most, a few vectors being present in any one cell. 

[00109] To identify the expression of chimeric proteins, host cells are transformed with 

the PTM library and plated so that individual colonies containing one PTM vector can be grown 
and purified. Single colonies are selected, isolated, and propagated in the appropriate media and 
the labeled chimeric protein exon(s) fragments are separated away from other cellular proteins 
using, for example, an affinity purification tag. For example, affinity chromatography can 
involve the use of antibodies that specifically bind to a peptide tag such as the FLAG tag. 
Alternatively, when utilizing HIS tags, the fusion proteins are purified using a Ni nitnloacetic 
acid agarose columns, which allows selective elution of bound peptide eluted with imidazole 
containing buffers. When using GST tags, the fusion proteins are purified using glutathione- S- 
transferase agarose beads. The fusion proteins can then be eluted in the presence of free 
glutathione. 

[00110] Following purification of the chimeric protein, an analysis is carried out to 

determine the amino acid sequence of the fusion protein. The amino acid sequence of the fusion 
protein is determined using techniques well known to those of skill in the art, such as Edman 
Degradation followed by amino acid analysis using HPLC, mass spectrometry or an amino acid 
analyzation. Once identified, the peptide sequence is compared to those sequences available in 
protein databases, such as GenBank. If the partial peptide sequence is already known, no further 
analysis is done. If the partial protein sequence is unknown, then a more complete sequence of 
that protein can be carried out to determine the full protein sequence. Since the fusion protein 
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will contain only a portion of the full length protein, a nucleic acid encoding the full length 
protein can be isolated using conventional methods. For example, based on the partial protein 
sequence oligonucleotide primers can be generated for use as probes or PCR primers to screen a 
cDNA library. 

6. EXAMPLE: PRODUCTION OF TRANS-SVLICJNG MOLECULES 
[00111] The following section describes the production of PTMs and the demonstration 

that such molecules are capable of mediating rraws-splicing reactions resulting in the production 
of chimeric mRNA molecules. 

6.1. MATERIALS AND METHODS 

6.1.1. CONSTRUCTION OF PRE-mRNA MOLECULES 
[00112] Plasmids containing the wild type diphtheria toxin subunit A (DT-A, wild-type 

accession #K01722) and a DT-A mutant (CRM 197, no enzymatic activity) were obtained from 
Dr. Virginia Johnson, Food and Drug Administration, Bethesda, Maryland (Uchida et al. 9 1973 J. 
Biol. Chem 248:3838). For in vitro experiments, DT-A was amplified using primers: DT-1F 
(S'-GGCGCTGCAGGGCGCTGATGATGTTGTTG) (SEQ ID NO:2); and DT-2R 
(S'-GGCGAAG CTTGGATCCGACACGATTTCCTGCACAGG) (SEQ ID NO:3), cut with PstI 
and Hindlll, and cloned into PstI and Hindlll digested pBS(-) vector (Stratagene, La Jolla, CA). 
The resulting clone, pDTA was used to construct the individual PTMs. (l)pPTM+: Targeted 
construct. Created by inserting IN3-1 (5 ' AATTCTCT AGATGCTT 

CACCCGGGCCTGACTCGAGTACTAACTGGTACCTCTTCTTTTTTTTCCTGCA) (SEQ ID 
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NO:4) and IN2-4 (5'-GGAAAAAAAAGAAGAGGTACCAGTTAGTACTCGAGTCAGG 
CCCGGGTGAAGCATCTAGAG) (SEQ ID NO:5) primers into EcoRI and Pstl digested pDTA. 
(2) pPTM+Sp: As pPTM+ but with a 30 bp spacer sequence between the BD and BP. Created 
by digesting pPTM+ with Xhol and ligating in the oligonucleotides, spacer S 
(5'-TCGAGCAACGTTATAATAATGTTC) (SEQ ID NO:6) and spacer AS 
(5 '-TCG AGAAC ATT ATT ATAACGTTGC) (SEQ ID NO:7). For in vivo studies, an EcoRI and 
Hindlll fragment of pcPTM+Sp was cloned into mammalian expression vector pcDNA3.1 
(Invitrogen), under the control of a CMV promoter. Also, the methionine at codon 14 was 
changed into isoleucine to prevent initiation of translation. The resulting plasmid was 
designated as pcPTM+Sp. (3) pPTM+CRM: As pPTM+Sp but the wild type DT-A was 
substituted with CRM mutant DT-A (T. Uchida, et al., 1973, J. Biol. Chem. 248:3838). This was 
created by PCR amplification of a DT-A mutant (mutation at G52E) using primers DT-1F and 
DT-2R. For in vivo studies, an EcoRI Hindlll fragment of PTM+CRM was cloned into 
pc3.1DNA that resulted in pcPTM+ARM. (4) PTM-: Non-targeted construct. Created by 
digestion of PTM+ with EcoRI and Pst I, gel purified to remove the binding domain followed by 
ligation of the oligonucleotides, IN-5 (5'-ATCTCTAGATCAGGCCCGGGTGAAGCC CGAG) 
(SEQ ID NO:8) and IN-6 (5'-TGCTTCACCC GGGCCTGATCTAGAG) (SEQ ID NO:9). (5) 
PTM-Sp, is an identical version of the PTM-, except it has a 30 bp spacer sequence at the Pstl 
site. Similarly, the splice mutants [Py(-)AG(-) and BP(-)Py(-)AG(-)] and safety variants 
[PTM+SF-Pyl, PTM+SF-Py2, PTM+SFBP3 and PTM+SFBP3-Pyl] were constructed either by 
insertion or deletion of specific sequences (see Table 1). 
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Table 1. Binding/non-binding domain, BP, PPT and 3' as sequences of different PTMs. 


PTM construct 


BD/NBD 


BP 


PPT 


3'ss 


PTM+Sp (targeted) 


: TGCTTC ACCCjG GGCCTG A 
(SEQ ID NO: 10) 


TACTAAC 


CTCTTCTTTTTTTTCC 
(SEQ ID NO: 11) 


CAG 


PTM-Sp (non-targeted) 


: C A ACGTTATA ATA ATGTT 
(SEQ ID NO: 12) 


TACTAAC 


CTCTTCTTTTTTTTCC 
(SEQ ID NO: 11) 


CAG 


PTM+Py (-)AG(-)BP(-) 


: TGCTTC ACCCGG GCCTG A 
(SEQ ID NO: 10) 


GGCTGAT 


CTGTGATTAATAGCGG 
(SEQ ID NO: 13) 


ACG 


PTM+Py(-)AG(-) 


:TGCTTCACCCG GGCCTG A 
(SEQ ID NO: 10) 


TACTAAC 


CCTGGACGCGGAAGT 
T 

(SEQ ID NO: 14) 


ACG 


PTM+SF 


: CTGGG AC A AGG AC ACTGCTT 
CACCCGGTTAGTAGACCACA 
GCCCTGAAGCC 
(SEQ ID NO: 15) 


TACTAAC 


CTTCTGTTTTTTTCTC 
(SEQ ID NO: 16) 


CAG 


PTM+SF-Pyl 


:As in PTM+SF 


TACTAAC 


CTTCTGTATTATTCTC 
(SEQ ID NO: 17) 


CAG 


PTM+SF-Py2 


: As in PTM+SF 


TACTAAC 


GTTCTGTCCTTGTCTC 
(SEQ ID NO: 18) 


CAG 


PTM+SF-BP3 


:As in PTM+SF 


TGCTGAC 


CTTCTGTTTTTTTCTC 
(SEQ ID NO: 16) 


CAG 


PTM+SFBP3-Pyl 


:As in PTM+SF 


TGCTGAC 


CTTCTGTATTATTCTC 
(SEQ ID NO: 17) 


CAG 



Nucleotides in bold indicate the mutations compared to normal BP, PPT and 3 ! splice site. 
Branch site A is underlined. The nucleotides in italics indicates the mismatch introduced into 
safety BD to mask the BP sequence in the PTM. 



[00113] A double-trans-splicing PTM construct (DS-PTM) was also made adding a 

5* splice site and a second target binding domain complementary to the second intron of pHCG 
pre-mRNA to the 3' end of the toxin coding sequence of PTM+SF (Figure A). 
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6.1.2. pHCG6 TARGET PRE-mRNA 
[00114] To produce the in vitro target pre-mRNA, a Sad fragment of PHCG gene 6 

(accession #X00266) was cloned into pBS(-). This produced an 805 bp insert from nucleotide 
460 to 1265, which includes the 5' untranslated region, initiation codon, exon 1, intron 1, exon 2, 
and most of intron 2. For in vivo studies, an EcoRI and BamHI fragment was cloned into 
mammalian expression vector (pc3.1DNA), producing (5HCG6. 

6.1.3. mRNA PREPARATION 
[00115] For in vitro splicing experiments, pHCG6, p-globin pre-mRNA and different 

PTM mRNAs were synthesized by in vitro transcription of BamHI and Hindlll digested plasmid 
DNAs respectively, using T7 mRNA polymerase (Pasman & Garcia-Blanco, 1996, Nucleic 
Acids Res. 24:1638). Synthesized mRNAs were purified by electrophoresis on a denaturing 
polyacrylamide gel, and the products were excised and eluted. 

6.1.4 IN VITRO SPLICING 
[00116] PTMs and target pre-mRNA were annealed by heating at 98 °C followed by slow 

cooling to 30-34°C. Each reaction contained 4 jal of annealed mRNA complex (100 ng of target 

and 200 ng of PTM), IX splice buffer (2 mM MgCl 2 , 1 mM ATP, 5 mM creatinine phosphate, 

and 40 mM KCI) and 4 jal of HeLa splice nuclear extract (Promega) in a 12.5 (il final volume. 

Reactions were incubated at 30°C for the indicated times and stopped by the addition of an equal 

volume of high salt buffer (7 M urea, 5% SDS, 100 mM LiCl, 10 mM EDTA and 10 mM Tris- 
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HCI, pH 7.5). Nucleic acids were purified by extraction with phenol:chloroform:isoamyl alcohol 
(50:49:1) followed by ethanol precipitation. 

6.1.5. REVERSE TRANSCRIPTION-PCR REACTIONS 
[00117] RT-PCR analysis was performed using EZ-RT PCR kit (Perkin-Elmer, Foster 

City, CA). Each reaction contained 10 ng of cis- or trans-spliced mRNA, or 1-2 p.g of total 
mRNA, 0.1 ul of each 3' and 5* specific primer, 0.3 mM of each dNTP, IX EZ buffer (50 mM 
bicine, 115 mM potassium acetate, 4% glycerol, pH 8.2), 2.5 mM magnesium acetate and 5 U of 
xTth DNA polymerase in a 50 ul reaction volume. Reverse transcription was performed at 60°C 
for 45 min followed by PCR amplification of the resulting cDNA as follows: one cycle of initial 
denaturation at 94°C for 30 sec, and 25 cycles of denaturation at 94° C for 18 sec and annealing 
and extension at 60°C for 40 sec, followed by a 7 min final extension at 70°C. Reaction products 
were separated by electrophoresis in agarose gels. 
[00118] Primers used in the study were as follows: 

DT-1F: GGCGCTGCAGGGCGCTGATGATGTTGTTG (SEQIDNO: 19) 
DT-2R: GGCGAAGCTTGGATCCGACACGATTTCCTGCACAGG (SEQ ID NO: 20) 
DT-3R: CATCGTCATAATTTCCTTGTG (SEQ ID NO: 21) 
DT-4R: ATGGAATCTACATAACCAGG (SEQ ID NO: 22) 
DT-5R: GAAGGCTGAGCACTACACGC (SEQ ID NO: 23) 
HCG-R2: CGGCACCGTGGCCGAAGTGG (SEQ ID NO: 24) 
Bio-HCG-F: ACCGGAATTCATGAAGCCAGGTACACCAGG (SEQ ID NO: 25) 

p- globulin-F: GGGCAAGGTGAACGTGGATG (SEQ ID NO: 26) 
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p- globulin-R: ATCAGGAGTGGACAGATCC (SEQ ID NO: 27) 

6.1.6. CELL GROWTH, TRANSFECTION AND mRNA ISOLATION 
[00119] Human lung cancer cell line H1299 (ATCC accession # CRL-5803) was grown in 

RPMI medium supplemented with 10% fetal bovine serum at 37°C in a 5% C0 2 environment. 
Cells were transfected with pcSp+CRM (CRM is a non-functional toxin), a vector expressing a 
PTM, or vector alone (pcDNA3.1) using lipofectamine reagent (Life Technologies, 
Gaithersburg, MD). The assay was scored for neomycin resistance (neo r ) colony formation two 
weeks after transfection. Four neo r colonies were selected and expanded under continued neo 
selection. Total cellular mRNA was isolated using RNA exol (BioChain Institute, Inc., San 
Leandro, CA) and used for RT-PCR. 

6.1.7. m4Att-SPLICING IN TUMORS IN NUDE MICE 
[00120] Eleven nude mice were bilaterally injected (except B10, Bll and B12 had 

1 tumor) into the dorsal flank subcutaneous space with 1 x 10 7 HI 299 human lung tumor cells 
(day 1). On day 14, the mice were given an appropriate dose of anesthesia and injected with, or 
without electroporation (T820, BTX Inc., San Diego, CA) in several orientations with a total 
volume of 100 fA of saline containing 100 jug pcSp+CRM with or without pc(3HCG6 or 
pcPTM+Sp. Solutions injected into the right side tumors also contained India ink to mark needle 
tracks. The animals were sacrificed 48 hours later and the tumor excised and immediately frozen 
at -80°C. For analysis, 10 mg of each tumor was homogenized and mRNA was isolated using a 
Dynabeads mRNA direct kit (Dynal) following the manufacturers directions. Purified mRNA 
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(2 Ad of 10 ,ul total volume) was subjected to RT-PCR using PHCG-F and DT-5R primers as 
described earlier. All samples were re-amplified using DT-3R, a nested DT-A primer and 
biotinylated PHCG-F and the products were analyzed by electrophoresis on a 2% agarose gel. 
Samples that produced a band were processed into single stranded DNA using M280 
Streptavidin Dynabeads and sequenced using a toxin specific primer (DT-3R). 

6.2. RESULTS 

6.2.1. SYNTHESIS OF PTM 
[00121] A prototypical trans-splicing mRNA molecule, pcPTM+Sp (Figure 1A) was 

constructed that included: an 18 nt target binding domain (complementary to pHCG6 intron 1), a 
30 nucleotide spacer region, branch point (BP) sequence, a polypyrimidine tract (PPT) and an 
AG dinucleotide at the 3' splice site immediately upstream of an exon encoding diphtheria toxin 
subunit A (DT-A) (Uchida et aL, 1973, J. Biol. Chem. 248:3838). Later DT-A exons were 
modified to eliminate translation initiation sites at codon 14. The PTM constructs were designed 
for maximal activity in order to demonstrate /ra/w-splicing; therefore, they included potent 3 ! 
splice elements (yeast BP and a mammalian PPT) (Moore et aL, 1993, In The mRNA World, 
R.F. Gesteland and J.F. Atkins, eds. (Cold Spring Harbor, New York: Cold Spring Harbor 
Laboratory Press). PHCG6 pre-mRNA (Talmadge et aL, 1984, Nucleic Acids Res. 12:8415) was 
chosen as a model target as this gene is expressed in most tumor cells. It is not expressed in 
normal adult cells, with the exception of some in the pituitary gland and gonads. (Acevedo et aL, 
1992, Cancer 76:1467; Hoon et aL, 1996, Int J. Cancer 69:369; Bellet et aL, 1997, Cancer Res. 
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57:516). As shown in Figure 1C> pcPTM+Sp forms conventional Watson-Crick base pairs by its 
binding domain with the 3' end of pHCG6 intron 1, masking the intronic 3' splice signals of the 
target. This feature is designed to facilitate trans-splicing between the target and the PTM. 
[00122] HeLa nuclear extracts were used in conjunction with established splicing 

procedures (Pasman & Garcia-Blanco, 1996, Nucleic Acids Res. 24:1638) to test if a PTM 
construct could invade the PHCG6 pre-mRNA target. The products of in vitro /raws-splicing 
were detected by RT-PCR, using primers specific for chimeric mRNA molecules. The predicted 
product of a successful /raws-splicing reaction is a chimeric mRNA comprising the first exon of 
(3HCG6, followed immediately by the exon contributed from pcPTM+Sp encoding DT-A (Figure 
1C). Such chimeric mRNAs were readily detected by RT-PCR using primers pHCG-F (specific 
to pHCG6 exon 1) and DT-3R (specific to DT-A, Figure 2 A, lanes 1-2). At time zero or in the 
absence of ATP, no 466 bp product was observed, indicating that this reaction was both ATP and 
time dependent. 

[00123] The target binding domain of pcPTM+Sp contained 18 nucleotides 

complementary to PHCG6 intron 1 pre-mRNA and demonstrated efficient trans-splicing (Figure 
2 A, lanes 1-2). 7Va/is-splicing efficiency decreased at least 8 fold (Figure 2, lanes 3-4) using 
non-targeted PTM-Sp, which contains a non-complementary 18 nucleotide "non-binding 
domain". 7Va«s-splicing efficiencies of PTM mRNAs with or without a spacer between the 
binding domain and BP were also compared. This experiment demonstrated a significant 
increase in the efficiency of /raws-splicing by the addition of a spacer (Figure 2B, lanes 2 + 5). 
To facilitate the recruitment of splicing factors required for efficient /raws-splicing, some space 
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may be needed between the 3* splice site and the double- stranded secondary structure produced 
by the binding domain/target interaction. 

[00124] To investigate the effect of PTM length on trans-splicing specificity, shorter 

PTMs were synthesized from AccI cut PTM plasmid (see Figure 1). This eliminated 479 nt from 
the 3' end of the DT-A coding sequence. Figure 2B shows the trans-splicing ability of a targeted 
short PTM(+) (lanes 10-12), compared to a non-targeted short PTM(-) (lanes 14-17). Short 
PTM+ produced substantially more /raw-spliced product (Figure 2B, lane 12) than its 
counterpart, non-targeted short PTM (Figure 2B, lane 17). These experiments indicate that 
longer PTMs may have increased potential to mediate rra/ts-splicing non-specifically. 

6.2.2. ACCURACY OF PTM SPLICEOSOME 

MEDIATED TRANSDUCING 

[00125] To confirm that /raws-splicing between the pcPTM+Sp and pHCG6 target is 

precise, RT-PCR amplified product was produced using 5' biotinylated (5HCG-F and non- 

biotinylated DT-3R primers. This product was converted into single stranded DNA and 

sequenced directly with primer DT-3R (DT-A specific reverse primer) using the method of 

Mitchell and Merril (1989, Anal. Biochem. 178:239). 7ra«s-splicing occurred exactly between 

the predicted splice sites (Figure 3), confirming that a conventional pre-mRNA can be invaded 

by an engineered PTM construct during splicing; moreover, this reaction is precise. 

[00126] In addition selective trans-splicing of a double splicing PTM (DS-PTM) was 

observed (Figure 8B). The DS-PTM can produce /ra/ts-splicing by contributing either a 3' or 5' 

splice site. Further, DS-PTMs can be constructed which will be capable of simultaneously 
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double-*ra«s-splicing, at both a 3 f and 5' site, thereby permitting exon replacement. Figure 8B 
demonstrates that in this construct the 5' splice site is most active at a 1:1 concentration of target 
PHCG pre-mRNA:DS-PTM. At a 1 :6 ratio the 3' splice site is more active. 

6.2.3. 3 ! SPLICE SITES ARE ESSENTIAL FOR PTM TRANS-SPUONG 
[00127] In general, the 3 f splice site contains three elements: 1) a BP sequence located 5 1 

of the acceptor site, 2) a PPT consisting of a short run of pyrimidine residues, and 3) a YAG 
trinucleotide splice site acceptor at the intron-exon border (Senapathy et aL, 1990, Cell 91:875; 
Moore et a/., 1993). Deletion or alteration of one of these sequence elements are known to either 
decrease or abolish splicing (Aebi et al 9 1986; Reed & Maniatis 1988, Genes Dev. 2:1268; Reed, 
1989, Genes Dev. 3:2113; Roscigno et aL, 1993, J. Biol. Chem. 268:11222; Coolidge et al. 9 
1997, Nucleic Acids Res. 25:888). The role of these conserved elements in targeted trans- 
splicing was addressed experimentally. In one case [(BP(-)Py(-)AG(-)], all three cis elements 
(BP, PPT and AG dinucleotide) were replaced by random sequences. A second splicing mutant 
[(Py(-)AG(-)] was constructed in which the PPT and the 3' splice site acceptor were mutated and 
substituted by random sequences. Neither construct was able to support frans-splicing in vitro 
(Figure 2A, lanes 5-8), suggesting that, as in the case of conventional cw-splicing, the PTM 
frara-splicing process also requires a functional BP, PPT and AG acceptor at the 3' splice site. 

6.2.4. DEVELOPMENT OF A "SAFETY" SPLICE SITE 
TO INCREASE SPECIFICITY 

[00128] To improve the levels of target specificity achieved by the inclusion of a binding 

domain or by shortening the PTM, the target-binding domain of several PTM constructs was 
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modified to create an intra-molecular stem to mask the 3' splice site (termed a "safety PTM"). 
The safety stem is formed by portions of the binding domain that partially base pair with regions 
of the PTM 3* splice site or sequences adjacent to them, thereby blocking the access of 
spliceosomal components to the PTM 3' splice site prior to target acquisition (Figure 4A, 
PTM+SF). Base pairing between free portions of the PTM binding domain and pHCG6 target 
region unwinds the safety stem, allowing splicing factors such as U2AF to bind to the PTM 3' 

splice site and initiate trans-splicing (Figure 4B). 

r 

[00129] This concept was tested in splicing reactions containing either PTM+SF (safety) 

or pcPTM+Sp (linear), and both target (PHCG6) and non-target (P-globin) pre-mRNA. The 
spliced products were subsequently analyzed by RT-PCR and gel electrophoresis. Using PHCG- 
F and DT-3R primers, the specific 196 bp trans-spliced band was demonstrated in reactions 
containing PHCG target and either linear PTM (pcPTM+Sp, Figure 5, lane 2) or safety PTM 
(PTM+SF, Figure 5, lane 8). Comparison of the targeted trans-splicing between linear PTM 
(Figure 5, lane 2) and safety PTM (Figure 5, lane 8) demonstrated that the safety PTM trans- 
spliced less efficiently than the linear PTM. 

[00130] Non-targeted reactions were amplified using p-globin-F (specific to exon 1 of p- 

globin) and DT-3R primers. The predicted product generated by non-specific PTM trans- 
splicing with p-globin pre-mRNA is 189 bp. Non-specific /rans-splicing was evident between 
linear PTM and p-globin pre-mRNA (Figure 5, lane 5). In contrast, non-specific trans-splicing 
was virtually eliminated by the use of safety PTM (Figure 5, lane 1 1). This was not unexpected, 
since the linear PTM was designed for maximal activity to prove the concept of spliceosome- 
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mediated trans-splicing. The open structure of the linear PTM combined with its potent 3' splice 
sites strongly promotes the binding of splicing factors. Once bound, these splicing factors can 
potentially initiate /raws-splicing with any 5 f splice site, in a process similar to trans-splicing in 
trypanosomes. The safety stem was designed to prevent splicing factors, such as U2AF from 
binding to the PTM prior to target acquisition. This result is consistent with a model that base- 
pairing between the free portion of the binding domain and the pHCG6 target unwinds the safety 
stem (by mRNA-mRNA interaction), uncovering the 3' splice site, permitting the recruitment of 
splicing factors and initiation of trans-splicing. No trans-splicing was detected between 0- 
globin and pHCG6 pre-mRNAs (Figure 5, lanes 3, 6, 9 and 12). 

6.2.5. IN VITRO TRANS-S?UCJNG OF SAFETY PTM AND VARIANTS 
[00131] To better understand the role of czs-elements at the 3' splice site in trans-splicing 

a series of safety PTM variants were constructed in which either the PPT was weakened by 
substitution with purines and/or the BP was modified by base substitution (see Table I). In vitro 
trans-splicing efficiency of the safety (PTM+SF) was compared to three safety variants, which 
demonstrated a decreased ability to trans-splice. The greatest effect was observed with variant 2 
(PTM+SFPy2), which was fra^-splicing incompetent (Figure 4C, lanes 5-6). This inhibition of 
*ra«s-splicing may be attributed to a weakened PPT and/or the higher T m of the safety stem. In 
contrast, variations in the BP sequence (PTM+SFBP3) did not markedly effect trans-splicing 
(Figure 4C, lanes 7-8). This was not surprising since the modifications introduced were within 
the mammalian branch point consensus range YNYURAC (where Y = pyrimidine, R = purine 
and N = any nucleotide) (Moore et al. y 1993). This finding indicates that the branch point 
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sequence can be removed without affecting splicing efficiency. Alterations in the PPT 
(PTM+SF-Pyl) decreased the level of /raws-splicing (lanes 3-4). Similarly, when both BP and 
PPT were altered PTM+SFBP3-Pyl, they caused a further reduction in trans-splicing (Figure 4C, 
lanes 9-10). The order of trans-splicing efficiency of these safety variants is 
PTM+SF>PTM+SFBP3> PTM+SFPyl>PTM+SFBP3-Pyl>PTM+SFPy2. These results confirm 
that both the PPT and BP are important for efficient in vitro trans-splicing (Roscigno et aL, 
1993, J. Biol. Chem. 268:11222). 

6.2.6. COMPETITION BETWEEN CIS- AND TRANS- SPLICING 
[00132] To determine if it was possible to block pre-mRNA ds-splicing by increasing 

concentrations of PTM, experiments were performed to drive the reaction towards rraws-splicing. 
Splicing reactions were conducted with a constant amount of (5HCG6 pre-mRNA target and 
various concentrations of trans-splicing PTM. Cw-splicing was monitored by RT-PCR using 
primers to PHCG-F (exon 1) and (5HCG-R2 (exon 2). This amplified the expected 125 bp cis- 
spliced and 478 bp unspliced products (Figure 6A). The primers pHCG-F and DT-3R were used 
to detect trans-spliced products (Figure 6B). At lower concentrations of PTM, c/s-splicing (Fig. 
6A, lanes 1-4) predominated over trans-splicing (Figure 6B S lanes 1-4). C/5-splicing was 
reduced approximately by 50% at a PTM concentration 1.5 fold greater than target. Increasing 
the PTM mRNA concentration to 3 fold that of target inhibited czs-splicing by more than 90% 
(Figure 6A, lanes 7-9), with a concomitant increase in the fraws'-spliced product (Figure 6B, 
lanes 6-10). A competitive RT-PCR was performed to simultaneously amplify both cis and 

trans-spliced products by including all three primers ((5HCG-F, HCG-R2 and DT-3R) in a single 
NY02:355526.1 58 



A31304-B-A-C 072874.0138 
PATENT 

reaction. This experiment had similar results to those seen in Figure 6, demonstrating that under 
in vitro conditions, a PTM can effectively block target pre-mRNA c/s-splicing and replace it 
with the production of an engineered ^raws-spliced chimeric mRNA. 

6.2.7. TRANS-SPLICING IN TISSUE CULTURE 

[00133] To demonstrate the mechanism of trans-splicing in a cell culture model, the 

human lung cancer line HI 299 (PHCG6 positive) was transfected with a vector expressing 
SP+CRM (a non- functional diphtheria toxin) or vector alone (pcDNA3.1) and grown in the 
presence of neomycin. Four neomycin resistant colonies were individually collected after 14 
days and expanded in the continued presence of neomycin. Total mRNA was isolated from each 
clone and analyzed by RT-PCR using primers pHCG-F and DT-3R. This yielded the predicted 
196 bp rraws-spliced product in three out of the four selected clones (Figure 7 A, lanes 2, 3 and 
4). The amplified product from clone #2 was directly sequenced, confirming that PTM driven 
fraws-splicing occurred in human cells exactly at the predicted splice sites of endogenously 
expressed pHCG6 target exon 1 and the first nucleotide of DT-A (Figure 7B). 

6.2.8. TRANS-SPLICING IN AN IN VIVO MODEL 

[00134] To demonstrate the mechanism of /raws-splicing in vivo, the following experiment 

was conducted in athymic (nude) mice. Tumors were established by injecting 10 7 HI 299 cells 
into the dorsal flank subcutaneous space. On day 14, PTM expression plasmids were injected 
into tumors. Most tumors were then subjected to electroporation to facilitate plasmid delivery 
(see Table 2, below). After 48 hrs, tumors were removed, poly-A mRNA was isolated and 
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amplified by RT-PCR. 7ra«s-splicing was detected in 8 out of 19 PTM treated tumors. Two 
samples produced the predicted trans-spliced product (466 bp) from mRNA after one round of 
RT-PCR. Six additional tumors were subsequently positive for trans-splicing by a second PCR 
amplification using a nested set of primers that produced the predicted 196 bp product (Table 2). 
Each positive sample was sequenced, demonstrating that pHCG6 exon 1 was precisely trans- 
spliced to the coding sequence of DT-A (wild type or CRM mutant) at the predicted splice sites. 
Six of the positive samples were from treatment groups that received cotransfected plasmids, 
pcPTM+CRM and pcHCG6, which increased the concentration of target pre-mRNA. This was 
done to enhance the probability of detecting trans-spliced events. The other two positive tumors 
were from a group that received only pcPTM+Sp (wild type DT-A). These tumors were not 
transfected with pHCG6 expression plasmid, demonstrating once again, as in the tissue culture 
model described in Section 6.2.7, that /raws-splicing occurred between the PTM and endogenous 
(3HCG6 pre-mRNA produced by tumor cells. 
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Table 2. 7raws-splicing in tumors in nude mice. 


Mouse 


Plasmid 


Left Right 


Electroporation 


RT-PCR 
Left Rieht 


Nested PCR 


Nucleotide Sequence 


'Bl 


pCMV-Sport 


Bl-1 Bl-2 


- 




- 


- 


B2 


pCMV-Sport 


Bl-3 Bl-4 


a 1000V/cm 




- 


- 




pCop+UKJVI 




i uuu v/cm 












B3-3 B3-4 


a 1000V/cm 


- 


- 


- 


B4 


pcSp+CRM 


B4-1 B4-2 


b 50V/cm 


- 


- 


- 






B4-3 B4-4 


c 25V/cm 








B5 


pcSp+CRM/ 
pcHCG6 


B5-1 B5-2 


iUUUV/cm 


+ 


i _i_ 

T T 


a TmTpp a n i nnnnTn a th a t 

A 1 Lf 1 1 ttAU 1 VjvJv^O 1 vJ/A 1 0/\ 1 

(SEQ ID NO:65) 






DC 1 DC ^ 
DJ-J DJ-H 






+ + 


ATGTTCC AG 1 GGCGTG ATGAT 
(SEQ ID NO:65) 


B6 


pcSp+CRM/ 
pcHCG6 


B6-1 B6-2 


b 50V/cm 












B6-3 B6-4 


c 25V/cm 




+ + 


ATGTTCCAG 1 GGCGTG ATGAT 
(SEQ ID NO:65) 


B7 


pc PTM+Sp 


B7-1 


a 1000V/cm 








B8 


pc PTM+Sp 


B8-1 


b 50V/cm 




+ 


ATGTTCCAG 1 GGCGTG ATGAT 
(SEQ IDNO:65) 


l B9 


pc PTM+Sp 


B9-1 






+ 


ATGTTCCAG 1 GGCGTG ATGAT 
(SEQ ID NO:65) 



a : 6 pulses of 99 /us sets of 3 pulses administered orthogonally 
b : 8 pulses of 10ms sets of 4 pulses administered orthogonally 
°: 8 pulses of 50ms sets of 4 pulses administered orthogonally 
+ : positive for RT-PCR trans-spliced produce 



: did not receive electroporation 

7. EXAMPLE: lacZ TRANS-SPLICING MODEL 
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[00135] In order to demonstrate and evaluate the generality of the mechanism of 

spliceosome mediated targeted trans-splicing between a specific pre-mRNA target and a PTM, a 
simple model system based on expression of enzyme (3-galactosidase was developed. The 
following section describes results demonstrating successful splicesome mediated targeted trans- 
splicing between a specific target and a PTM. 

7.1. MATERIALS AND METHODS 

7.1.1. PRIMER SEQUENCES 

[00136] The following primers were used for testing the lacZ model system: 

5' Lac-IF GCATGAATTCGGTACCATGGGGGGGTTCTCATCATCATC (SEQ ID NO: 
28) 

5* Lac-IR CTGAGGATCCTCTTACCTGTAAACGCCCATACTGAC (SEQ ID NO: 29) 

3' Lac- 1 F GCATGGTAACCCTGCAGGGCGGCTTCGTCTGGGACTGG (SEQ ID NO: 
30) 

3' Lac-IR CTGAAAGCTTGTTAACTTATTATTTTTGACACCAGACC (SEQ ID NO: 3 1) 

3' Lac-Stop GCATGGTAACCCTGCAGGGCGGCTTCGTCTAATAATGGGACTGGGTG 
(SEQ ID NO: 32) 

HCG-InlF GCATGGATCCTCCGGAGGGCCCCTGGGCACCTTCCAC (SEQ ID NO: 33) 

HCG-InlR CTGACTGCAGGGTAACCGGACAAGGACACTGCTTCACC (SEQ ID NO: 
34) 

HCG-Ex2F GCATGGTAACCCTGCAGGGGCTGCTGCTGTTGCTG (SEQ ID NO: 35) 
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HCG-Ex2R CTGAAAGCTTGTTAACCAGCTCACCATGGTGGGGCAG (SEQ ID NO: 36) 
Lac-TRl (Biotin): 7-GGCTTTCGCTACCTGGAGAGAC (SEQ ID NO: 37) 
Lac-TR2 GCTGGATGCGGCGTGCGGTCG (SEQ ID NO: 38) 
HCG-R2: CGGCACCGTGGCCGAAGTGG (SEQ ID NO: 39) 

7. 1.2. CONSTRUCTION OF THE lacZ PRE-mRNA TARGET MOLECULE 
[00137] The lacZ target 1 pre-mRNA (pc3.1 lacTl) was constructed by cloning of the 

following three PCR products: (i) the 5' fragment of lacZ; followed by (ii) pHCG6 intron 1; (iii) 
and the 3' fragment of lacZ. The 5' and 3' fragment of the lacZ gene were PCR amplified from 
template pcDNA3.1/His/lacZ (Invitrogen,San Diego, CA) using the following primers: 5' Lac- 
1F and 5'Lac-lR (for 5' fragment), and 3'Lac-lF and 3' Lac-IR (for 3' fragment). The amplified 
lacZ 5' fragment is 1788 bp long which includes the initiation codon, and the amplified 3' 
fragment is 1385 bp long and has the natural 5' and 3' splice sites in addition to a branch point, 
polypyrimidine tract and (3HCG6 intron 1. The PHCG6 intron 1 was PCR amplified using the 
following primers: HCG-InlF andHCG-InlR. 

[00138] The lacZ target 2 is an identical version of lacZ target 1 except it contains two 

stop codons (TAA TAA) in frame four codons after the 3 'splice site. This was created by PCR 
amplification of the 3' fragment (lacZ) using the following primers: 3' Lac-Stop and 3' Lac 1R 
and replacing the functional 3' fragment in lacZ target 1. 



NY02:355526.1 



63 



A31304-B-A-C 072874.0138 
PATENT 

7.1.3. CONSTRUCTION OF pc3.1 PTM1 and pc3.1 PTM2 
[00139] The pre-trans-splicing molecule, pc3.1 PTM1 was created by digesting pPTM 

+Sp with PstI and Hindlll and replacing the DNA fragment encoding the DT-A toxin with the a 
DNA fragment encoding the functional 3' end of lacZ. This fragment was generated by PCR 
amplification using the following primers: 3' Lac-IF and 3' Lac-IR. For cell culture 
experiments, an EcoRI and Hindlll fragment of pc3.1 PTM2 which contains the binding domain 
to HCG intron 1, a 30 bp spacer, a yeast branch point (TACTAAC), and strong polypyrimidine 
tract followed by the lacZ cloned was cloned into pcDNA3.1. 

[00140] The pre-trans-splicing molecule, pc3.1 PTM2 was created by digesting pPTM 

+Sp with PstI and Hindlll and replacing the DNA fragment encoding the DT-A toxin with the 
pHCG6 exon 2. (5HCG6 exon 2 was generated by PCR amplification using the following 
primers: HCG-Ex2F and HCG-Ex2R. For cell culture experiments, an EcoRI and Hindlll 
fragment of pc3.1 PTM2 which contains the binding domain to HCG intron 1, a 30 bp spacer, a 
yeast branch point (TACTAAC), and strong polypyrimidine tract followed by the [3HCG6 exon 
2 cloned was used. 

7. 1 .4. CO-TRANSFECTION OF THE lacZ SPLICE TARGET 

PRE-mRNA AND PTMS INTO 293T CELLS 

[00141] Human embryonic kidney cells (293T) were grown in DMEM medium 

supplemented with 10% FBS at 37°C in a 5% C0 2 . Cells were co-transfected with pc3.1 LacTl 
and pc3.1 PTM2, or pc3.1 LacT2 and pc3.1 PTM1, using Lipofectamine Plus (Life 
Technologies,Gaithersburg, MD) according to the manufacturer's instructions. 24 hours post- 
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transfection, the cells were harvested; total RNA was isolated and RT-PCR was performed using 
specific primers for the target and PTM molecules, p-galactosidase activity was also monitored 
by staining the cells using a p-gal staining kit (Invitrogen, San Diego. CA). 

7.2. RESULTS 

7.2.1. THE lacZ SPLICE TARGET OS-SPLICES EFFICIENTLY 
TO PRODUCE FUNCTIONAL p -GALACTOSE) ASE 

[00142] To test the ability of the splice target pre-mRNA to c/s-splice efficiently, pc3.1 

lacYl was transfected into 293 T cells using Lipfectamine Plus reagent (Life 
Technologies,Gaithersburg, MD) followed by RT-PCR analysis of total RNA. Sequence 
analysis of the ci's-spliced RT-PCR product indicated that splicing was accurate and occurred 
exactly at the predicted splice sites (Fig. 12B). In addition, accurate cis-splicing of the target 
pre-mRNA molecule results in formation of a mRNA capable of encoding active p-galactosidase 
which catalyzes the hydrolysis of p-galactosidase, i.e., X-gal, producing a blue color that can be 
visualized under a microscope. Accurate c/s-splicing of the target pre-mRNA was further 
confirmed by successfully detecting p-galactosidase enzyme activity. 

[00143] Repair of defective lacZ target 2 pre-mRNA by trans-splicing of the functional 3 ! 

lacZ fragment (PTM1) was measured by staining for p-galactosidase enzyme activity. For this 
purpose, 293T cells were co-transfected with lacZ target 2 pre-mRNA (containing a defective 3' 
fragment) and PTM1 (contain normal 3' lacZ sequence). 48 hours post-transfection cells were 
assayed for P-galactosidase enzyme activity. Efficient trans-splicing of PTM1 into the lacZ 
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target 2 pre-mRNA will result in the production of functional p-galactosidase activity. As 
demonstrated in Figure 1 1B-E, trans-splicing of PTM 1 into lacZ target 2 results in restoration of 
p-galactosidase enzyme activity up to 5% to 10% compared to control. 

7.2.2. TARGETED ™Atf-SPLICING BETWEEN 
THE lacZ TARGET PRE-mRNA and PTM2 

[00144] To assay for /ra^s-splicing, lacZ target pre-mRNA and PTM2 were transfected 

into 293 T cells. Following transfection, total RNA was analyzed using RT-PCR. The following 
primers were used in the PCR reactions: /acZ-TRl (lacZ 5' exon specific) and HCGR2 (pHCGR 
exon 2 specific). The RT PCR reaction produced the expected 195 bp trans-spliced product ( 
Fig. 11, lanes 2 and 3) demonstrating efficient trans-splicing between the lacL target pre-mRNA 
and PTM 2. Lane 1 represents the control, which does not contain PTM 2. 
[00145] The efficiency of the /raws-splicing was also measured by staining for p- 

galactosidase enzyme activity. To assay for trans-splicing, 293T cells were co-transfected with 
lacZ target pre-mRNA and PTM 2. 24 hours post-transfection, cells were assayed for p- 
galactosidase activity. If there is efficient trans-splicing between the target pre-mRNA and the 
PTM, a chimeric mRNA is produced consisting of the 5' fragment of the lacZ target pre-mRNA 
and pHCG6 exon 2 is formed which is incapable of coding for an active p-galactosidase. 
Results from the co-transfection experiments demonstrated that /raws-splicing of PTM2 into 
lacZ target 1 resulted in the reduction of P-galactosidase activity by compared to the control. 
[00146] To further confirm that trans-splicing between the lacZ target pre-mRNA and 

PTM2 is accurate, RT-PCR was performed using 5' biotinylated lacZ-TRl and non-biotinylated 
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HCGR2 primers. Single stranded DNA was isolated and sequenced directly using HCGR2 
primer (HCG exon 2 specific primer). As evidenced by the sequence of the splice junction, 
trans-splicing occurred exactly as predicted between the splice sites (Fig. 12A and 12B), 
confirming that a conventional pre-mRNA can be invaded by an engineered PTM during 
splicing, and moreover, that this reaction is precise. 

8. EXAMPLE: CORRECTION OF THE CYSTIC FIBROSIS 
TRANSMEMBRANE REGULATOR GENE 

[00147] Cystic fibrosis (CF) is one of the most common genetic diseases in the world. 

The gene associated with CF has been isolated and its protein product deduced (Kerem, B.S. et 
al., 1989, Science 245:1073-1080; Riordan et al., 1989, Science 245: 1066-1 073 ;Rommans, et al., 
1989, Science 245:1059-1065). The protein product of the CF associated gene is referred to as 
the cystic fibrosis trans-membrane conductance regulator (CFTR). The most common disease- 
causing mutation which accounts for ~70% of all mutant alleles is a deletion of three nucleotides 
in exon 10 that encode for a phenylalanine at position 508 (AF508). The following section 
describes the successful repair of the cystic fibrosis gene using spliceosome mediated trans- 
splicing and demonstrates the feasibility of repairing CFTR in a model system. 

8.1 MATERIALS AND METHODS 

8.1.1. PRE-TRANS-SPLICING MOLECULE 
[00148] The CFTR pre-trans-splicing molecule (PTM) consists of a 23 nucleotide binding 

domain complimentary to CFTR intron 9 (3 ? end, -13 to -31), a 30 nucleotide spacer region (to 
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allow efficient binding of spliceosomal components), branch point (BP) sequence, 
polypyrimidine tract (PPT) and an AG dinucleotide at the 3 ! splice site immediately upstream of 
the sequence encoding CFTR exon 10 (wild type sequence containing F508). This initial PTM 
was designed for maximal activity in order to demonstrate /raws-splicing; therefore the PTM 
included a UACUAAC yeast consensus BP sequence and an extensive PPT. An 18 nucleotide 
HIS tag (6 histamine codons) was included after wild type exon 10 coding sequence to allow 
specific amplification and isolation of the /raws-spliced products and not the endogenous CFTR. 
The oligonucleotides used to generate the two fragments included unique restriction sites. (Apal 
and PstI, and PstI and NotI, respectively) to facilitate directed cloning of amplified DNA into the 
mammalian expression vector pcDNA3.1. 

8.1.2. THE TARGET CFTR PRE-mRNA MINI-GENE 
[00149] The CFTR mini-gene target is shown in Figure 13 and consists of CFTR exon 9 ; 

the functional 5 1 and 3 1 regions of intron 9 (260 and 265 nucleotides from each end, respectively); 
exon 10 [AF508]; and the 5' region of intron 10 (96 nucleotides). In addition, as depicted in 
Figure 16, a mini-target gene comprising CFTR exons 1-9 and 10-24 can be used to test the use 
of spliceosome mediated /raws-splicing for correction of the cystic fibrosis mutation. Figure 
17, shows a double splicing PTM that may also be used for correction of the cystic fibrosis 
mutation. As shown, the double splicing PTM contains CFTR BD intron 9, a spacer, a branch 
point, a polypyrimidine tract, a 3* splice site, CFTR exon 10, a spacer, a branch point, a 
polypyrimidine tract, a 5' splice site and CFTR BD exon 10. 
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8.1.3. OLIGONUCLEOTIDES 
[00150] The following oligonucleotides were used to create CFTR PTM: 

Forward CF3 

ACCT GGGCCC ACC CAT TAT TAG GTC ATT AT CCGCGG AAC ATT ATA 
Apalsite. Intron 9 CFTR, -12 to -34. (SEQ ID NO: 40) 



Reverse CF4 

ACCT CTGCAGGTGA CC CTG CAG GAA AAA AAA GAA G (SEQIDNO:41) 
PstI BstEI PPT 



Forward CF5 

ACCT CTGCAG ACT TCA CTT CTA ATG ATG AT (SEQIDNO:42) 
PstI. Exon 10 CFTR, +1 to +24 



Reverse CF6 

ACCT GCGGCCGC CTA ATG ATG ATG ATG ATG ATG CTC TTC TAG TTG GCA 

TGC 

Not I. Stop Polyhistamine tag Exon 1 0 CFTR, +1 5 to +1 32 

(SEQ ID NO: 43) 



[00151] The following nucleotides were used to create the CFTR TARGET 

pre-mRNA mini gene (Exon 9 + mini-Intron 9 + Exon 10 + 5' end Intron 10): 



Forward CF18 

GACCT CTCGAG GGA TTT GGG GAA TTA TTT GAG (SEQ ID NO: 44) 
Xhol Exon 9 CFTR, 1 to 2 1 . 

Reverse CF19 

CTGACCT GCGGCCGC TAC AGT GTT GAA TGT GGT GC (SEQ ID NO: 45) 
Notl. Intron 9 5' end. 



Forward CF20 

CTGACCT GCGGCCGC CCA ACT ATC TGA ATC ATG TG (SEQ ID NO: 46) 
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Notl. Intron 9 3* end. 

Reverse CF21 

GACCT CTTAAG TAG ACT AAC CGA TTG AAT ATG (SEQ ID NO: 47) 
Aflll Intron 10 5' end. 

[00152] The following oligonucleotides were used for detection of trans-spliced products: 

Reverse Bio-His 

CTA ATG ATG ATG ATG ATG ATG (SEQ ID NO: 48) 
Stop. Polyhistidine tag (5' biotin label). 

Reverse Bio-His(2) 

CGC CTA ATG ATG ATG ATG ATG (SEQ ID NO: 49) 
3' UT Stop. Polyhistidine tag (5* biotin label). 

Forward CF8 

CTT CTT GGT ACT CCT GTC CTG (SEQ ID NO: 50) 
Exon 9 CFTR. 



Forward CF18 

GACCT CTCGAG GGA TTT GGG GAA TTA TTT GAG (SEQ ID NO: 51) 
Xhol. Exon 9 CFTR. 

Reverse CF28 

AAC TAG AAG GCA CAG TCG AGG (SEQ ID NO: 52) 
Pc3.1 vector sequence (present in PTM 3' UT but not target). 



8.2. RESULTS 

[00153] The PTM and target pre-mRNA were co-transfected in 293 embryonic kidney 

cells using lipofectamine (Life Technologies,Gaithersburg, MD). Cells were harvested 24 h post 
transfection and RNA was isolated. Using PTM and target-specific primers in RT-PCR 
reactions, a trans-spliced product was detected in which mutant exon 10 of the target pre-mRNA 
was replaced by the wild type exon 10 of the PTM (Figure 14). Sequence analysis of the trans- 
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spliced product confirmed the restoration of the three nucleotide deletion and that splicing was 
accurate, occurring at the predicted splice sites (Figure 15), demonstrating for the first time RNA 
repair of the cystic fibrosis gene, CFTR (Mansfield et al., 2000, Gene Therapy 7:1885-1895). 

9. EXAMPLE: DOUBLE- 77^4A^-SPLICING 
[00154] The following example demonstrates accurate replacement of an internal exon by 

a double-/ra«s-splicing between a target pre-mRNA and a PTM RNA containing both 3' and 5' 
splice sites leading to production of full length functionally active protein. 
[00155] As described herein, any pre-mRNA can be reprogrammed by providing a trans- 

reactive RNA molecule containing either a 3 '-splice site, a 5'-splice site or both. The following 
example describes successful targeting and replacement of a single internal exon utilizing pre- 
rrans-splicing molecules (PTMs) containing both the 5' and 3' splice sites. Such PTMs can 
promote two frans-splicing reactions with the intended target gene mediated by the 
splicesome(s). To test this mechanism, a splicing lacZ model target gene consisting of lacZ 5' 
"exon" - CFTR mini-intron 9 - CFTR exon 10 (AF508) - CFTR mini-intron 10 followed by lacZ 
3' "exon" was created. In this target transcript, a 124 bp central portion of the p-galactosidase 
ORF was substituted by exon 10 (AF508) of CFTR, thus it produces non- functional protein. A 
PTM consisting of the missing 124 bp lacZ "mini-exon" and a 5' and 3' /raws-splicing domain 
containing binding domains (BDs) complementary to the target introns and exons was created. 
Transfection of HEK 293T cells with either target alone or PTM alone showed no detectable 
levels of p-gal activity. In contrast, 293T cells transfected with target plus PTM produced 
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substantial levels of P-gal activity indicating the restoration of protein function. The accuracy of 
trans-splicing between the target and PTM was confirmed by sequencing the appropriate RT- 
PCR product, which revealed the predicted internal exon substitution. The feasibility of this 
approach in a disease model was tested by replacing the CFTR AF508 exon 10 with normal exon 
10 containing F508 in cystic fibrosis. These results demonstrate that a trans-splicing technology 
can be easily adapted to correct many of the genetic defects whether they are associated with the 
5 ! exon or 3 ! exon or any internal exon of the gene. 

[00156] Figure 18 is a schematic of a model lacZ target consisting of lacZ 5 f exon - CFTR 

mini-intron 9 - CFTR exon 10 (delta 508) - CFTR min-intron 10 followed by the lacZ 3' exon. 
In this target, a 124 bp central portion of the lacZ gene is substituted with CFTR exon 10 which 
has a mutation at position 508 (delta 508). The pre-mRNA target undergoes normal c/s-splicing 
to produce an mRNA consisting of lacZ 5' exon - CFTR exon 10 (delta 508) followed by the 
lacZ 3* exon. Because of the disruption in p-galactosidase ORF it produces truncated proteins 
which are non- functional. 

[00157] To restore P-gal function by double-^rara-splicing, three PTMs were created 

consisting of the missing 124 bp lacZ "mini-exon" and a 5' and 3 f /ra/w-splicing domain 
containing binding domains complementary to the target introns and exons as shown in Figure 
19. These PTMs have an 120 bp 3' binding domain (complementary to intron 9) from PTM24 
(see below) used in 3 ! exon replacement, spacer sequence, yeast branch point, polypyrimidine 
tract, 3 ! acceptor AG dinucleotide, lacZ "mini-exon", 5 1 splice site, spacer sequence followed by 
the 5 f binding domain. These PTMs differ only in their 5 1 binding domain sequences. DSPTM5 
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has a 27 bp BD which is complementary to intron 10 and blocks just the 5' splice site of the 
target. DSPTM6 has 120 bp 5' BD and covers both 5' and 3' splice sites of the target, while, 
DSPTM7 has 260 bp BD which masks both the splice sites (5' and 3') and also covers the entire 
exon of the target. 

[00158] A schematic representation of a double-fra/zs-splicing reaction showing the 

binding of DSPTM7 with DSCFT1.6 target pre-mRNA is shown in Figure 20. 3' BD: 120 bp 
binding domain complementary to mini-intron 9; 5' BD (260 bp); second binding domain 
complementary to mini-intron 10 and exon 10. ss: splice sites; BP: branch point, and PPT: 
polypyrimidine tract. 

[00159] The important structural elements of DSPTM7 (Figure 21) are as follows: 

(1) 3' BD (120 BP) (SEP ID NO:69) : 

GATTCACTTGCTCCAATTATCATCCTAAGCAGAAGTGTATATTCTTATT 
TGTAAAGATTCTATTAACTCATTTGATTCAAAATATTTAAAATACTTCCTGTTT 
CATACTCTGCTATGCAC 

(2) Spacer sequences (24 bp) (SEP ID NP:70) : AACATTATTATAACGTTGCTCGAA 

(3) Branch point, pvrimidine tract and acceptor splice site(SEP ED NG:71) : 

3'ss 

BP Kpn 1 PPT ECORV i lacZ mini-exon 

TACTAAC T GGTACC TCTTCTTTTTTTTTT GATATC CTGCAG | GGC GGC | 
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(4) 5' donor site and 2 n d spacer sequence (SEP ID NO:72) : 

5* ss 

lacZ mini-exon >L 

| TGA ACG | GTAAGT GTTATCACCGATATGTGTCTAACCTGATTCGGGCCTTC 
GATACGCTAAGATCCACCGG 

(5) 5' BP (260 BPVSEO ID NO:73) : 

TCAAAAAGTTTTCACATAATTTCTTACCTCTTCTTGAATTCATGCTTTG 
ATGACGCTTCTGTATCTATATTCATCATTGGAAACACCAATGATTTTTCTTTAA 
TGGTGCCTGGCATAATCCTGGAAAACTGATAACACAATGAAATTCTTCCACT 
GTGCTTAAAAAAACCCTCTTGAATTCTCCATTTCTCCCATAATCATCATTACA 
ACTGAACTCTGGAAATAAAACCCATCATTATTAACTCATTATCAAATCACGC 

[00160] To determine whether the restoration of (3-gal function is RNA trans-splicing 

mediated, the mutants are depicted in Figure 22. DSPTM8 is a 3' splice mutant in which the 3' 
splice elements such as BP, polypyrimidine tract and the 3' acceptor AG dinucleotides were 
deleted and replaced with random sequences (SEQ ID NO:85). This PTM still has 3* and 5' 
binding domains and the functional 5' splice site. PTM29 lacks the 2 nd binding domain + 5' ss 
but still has the 3' binding domain 3' splice site, while PTM30 lacks the 1 st binding domain + 3' 
splice site but has the functional 5' splice site and 2 nd binding domain. 
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[00161] To examine the double-fraws-splicing mediated restoration of p-gal function, 

293T cells were either transfected with 2 jug of target or PTM alone or co-transfected with 2 //g 
of target + 1.5 Aig of PTM using Lipofectamine Plus reagent. 48 hrs. after transfection, total 
RNA was isolated and analyzed by RT-PCR using K1-1F and Lac-6R primers. These primers 
amplify both cis- and /raws-spliced products in a single reaction which were identified based on 
the size. The c/s-spliced product is 295 bp in size while the trans-spliced product is 230 bp in 
size. To confirm that ^raws-splicing between DSPTM7 and DSCFT1.6 pre-mRNA is precise, 
RT-PCR amplified products were excised, re-amplified using K1-2F and Lac-6R primers and 
sequenced directly using K1-2F or Lac-6R primers. As shown in Figure 23 trans-splicing 
occurred exactly at the predicted splice sites, confirming the precise internal exon substitution by 
two /raws-splicing events (SEQ ID NO: 86, 87). 

[00162] The repair of defective lacZ pre-mRNA by double trans-splicing events and 

subsequent production of full-length p-gal protein was investigated in co-transfection assays. 
293T cells were co-transfected with DSCFT1.6 target and DSPTM7 expression plasmids, as well 
as with DSCFT1.6 target or DSPTM7 alone as controls. Western blot analysis of total cell 
lysates using polyclonal anti-p-galactosidase antiserum specifically recognized a - 120 kDa 
protein only in cells co-transfected with DSCFT1.6 target + DSPTM7 plasmids (Fig. 24, lanes 3 
and 4) but not in cells transfected with either DSCFT1.6 target (Lane 1) or DSPTM7 plasmid 
alone (Lane 2). Similarly, no full-length protein was detected in cells co-transfected with 
DSCFT1.6 target + 3' splice mutant (Lane 5 and 6) or PTM29 or 30 in which either 3' trans- 
splicing domain or 5* trans-splicing domains has been deleted (Lane 7). In addition, the 120 kDa 
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protein band co-migrated with the full-length functional P-gal produced using lacZ-Tl plasmid 
(positive control, data not shown). These results not only confirmed the production of full-length 
protein by double-/raws-splicing between the target and PTM but also demonstrated that both the 
3 ! splice site and 5 f splice sites are essential for this process. 

[00163] To determine whether the full-length protein produced by double-fra/is-splicing 

between the target pre-mRNA and DSPTM7 RNA is functionally active, 293T cells were co- 
transfected with DSCFT1.6 targeted + one of the double splicing PTMs 5, 6 or 7 expression 
plasmids, or transfected with DSCFT1.6 target or DSPTM7 alone. Total cell extracts were 
prepared and assayed for p-gal activity using ONPG assay (Invitrogen). p-gal activity in extracts 
prepared from cells transfected with either DSCFT1.6 target or DSPTM7 alone was almost 
identical to the background levels detected in mock transfection (Fig. 25). In contrast, 293T cells 
co-transfected with DSCFT1.6 target and DSPTM7 produced - 21 fold higher levels of P-gal 
activity over the background (Fig. 25). These results confirmed the accurate double-*ra«s- 
splicing between the target pre-mRNA and PTM RNA and production of the full-length 
functional protein. 

[00164] To confirm that restoration of p-gal activity by double-frans-splicing reaction is 

absolutely depended on the presence of both 3 1 and 5' splice sites of the PTM, we constructed 
several mutants: (a) DSPTM8, is identical to DSPTM7 except the functional 3' spice elements 
(branch point, polypyrimidine tract and the 3' acceptor AG dinucleotides) were deleted and 
substituted with random sequences (see Fig. 22 for details); (b) PTM29 lacks 5' splice site as 
well as the 5' binding domain but has the 3' binding domain and 3' splice site, and (c) PTM30 
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lacks 3' binding domain and 3' splice site but has the 5' splice site and 5' binding domain, p-gal 
activity in extracts prepared from cells transfected with either DSCFT1.6 target or DSPTM7 
alone was almost identical to the background levels detected in mock transfection (Fig. 26). 
Similarly, no significant increase in p-gal activity was detected in cells transfected with either 
DSPTM8 alone (3 ? splice site mutant) or co-transfection of DSCFT1.6 target + one of the above 
mutant PTMs. On the other hand, cells co-transfected with DSCFT1.6 target and DSPTM7 with 
functional 3' and 5 1 splice sites produced substantial levels of p-gal activity over the background 
(Fig. 26). These results confirmed the requirement of both splice sites in the double-splicing 
PTM and also eliminated the possibility that restoration of p-gal activity was due to 
complementation between the truncated proteins (Fig. 26). 

[00165] Different concentrations of the target and PTM were co-transfected and analyzed 

for p-gal activity restoration. As expected, 293T cells co-transfected with DSCFT1.6 target + 
DSPTM7 showed substantial levels of P-gal activity (~ 30 fold) over the controls. Increasing the 
concentrations of the PTM by 2 and 3 fold did increase the level of P-gal activity, but not 
significantly (Fig. 27). These results further confirmed the double-frans-splicing mediated 
restoration of P-gal enzyme function. 

[00166] The specificity of double-frans-splicing reaction was examined by constructing a 

non-specific target (DSHCGT1.1) which is similar to that of specific target (DSCFT1.6) but has 
pHCG intron 1 - pHCG exon 2 and PHCG intron 2 instead of CFTR mini-intron 9 - CFTR exon 
10 (delta 508) and CFTR mini-intron 10 (Fig. 28). RT-PCR analysis of the total RNA isolated 
from cells transfected with either DSHCGT1.1 (non-specific target) alone or in combination 
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DSPTM7 (targeted to DSCFT1.6 target) failed to produce the expected 314 bp double-fraws- 
spliced product. On the other hand, RT-PCR analysis of the total RNA prepared from cells co- 
transfected with specific target + PTM produced the expected 314 pb product. This was further 
confirmed by p-gal activity assay of the total cellular extract. The level p-gal activity detected in 
cells transfected with non-specific target alone or in combination with DSPTM7 targeted to 
DSCFT1.6 target was almost identical to the background level. In contrast substantial levels of 
P-gal activity was detected in cells co-transfected with specific target (DSCFT1.6) + DSPTM7 
(Fig. 27). These results confirmed that the double-^raws-splicing is highly specific. 
[00167] The repair model in Fig. 30 shows a portion of a target CFTR pre-mRNA 

consisting of exons 1-9, mini-intron 9, exon 10 containing the delta 508 mutation, mini-intron 10 
and exons 11-24 (Fig. 30). The PTM shown in the figure consists of exon 10 coding sequences 
(containing codon 508) and two trans-splicing domains each with its own splicing elements 
(acceptor and donor sites, branchpoint and pyrimidine tract) and a binding domain 
complementary to intron 9 splice site, part of exon 10 (5 1 and 3' ends) and intron 10 5' splice site 
(SEQ ID NO:88)(Fig. 31 (DS-CF1)). Exon 10 of the PTM also has modified codon usage 
throughout to reduce antisense effects between exon 10 of the PTM and it's own binding 
domains and for PTMs that have binding domains which are complementary to exon sequences 
(Fig. 31). A double-frans-splicing event between the PTM and target should produce a repaired 
full-length mRNA. 

[00168] Fig. 32 shows the sequence of a single PCR product showing target exon 9 

correctly spliced to PTM 20 exon 10 (with modified codons) (upper panel) (SEQ ID NO: 89), 
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codon 508 in exon 10 of the PTM (middle panel) (SEQ ID NO:90)and PTM exon 10 correctly 
spliced to target exon 1 1 (lower panel) (SEQ ID NO:91). The sequence of a repaired target was 
generated by RT-PCR followed by PCR. 

10. EXAMPLE: TRANS-SPLICING REPAIR OF THE 

CYSTIC FIBROSIS GENE USING A PTM 
THAT CAN PERFORM 5 1 EXON REPLACEMENT 

The key advantage of using 5' exon replacement for gene repair are 

(a) it permits replacement of the 5 f portion of a gene 

(b) the construct requires less sequence and space than a full-length gene 

(c) PTMs can be produced that lack a polyA signal which should prevent PTM 
and (d) the 5' end can be modified to increase translation. 

10.1 MATERIALS AND METHODS 

10.1.1 PLASMID CONSTRUCTION 
[00170] The CFTR coding sequences (exons 1-10) for PTM30 were generated by PCR 

using a partial cDNA plasmid template (61160; American Type Culture Collection, Manassas, 
VA). The /ra«5-splicing domain (TSD) [including the binding domain, spacer sequence, 
polypyrimidine tract (PPT), branchpoint (BP) and 3* splice site] was generated from a PCR 
product (using an existing plasmid template) and by annealing oligonucleotides. The different 
fragments (the TSD and coding sequences) were then cloned into pcDNA3.1(-) using appropriate 
restriction sites. Oligodeoxynucleotide primers were procured from Sigma Genosys (The 
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Woodlands, TX). All PCR products were generated with either KEDTaq (Sigma, St. Louis, 
MO), or cloned Pfu (Stratagene, La Jolla, CA) DNA Polymerase. PCR primers for amplification 
contained restriction sites for directed cloning. PCR products were digested with the appropriate 
restriction enzymes and cloned into the mammalian expression plasmid pc3.1DNA(-) 
(Invitrogen, Carlsbad, CA). 

10.1.2 CELL CULTURE AND TRANSFECTIONS 

[00171] Constructs were cotransfected in human embryonic kidney (HEK) 293T or 

293 cells (1.25 x 10 6 cells per 60 mm poly-d-lysine coated dish) using LipofectaminePlus (Life 
Technologies, Gaithersburg, MD) and the cells were harvested 48 h after the start of transfection. 
Total RNA was isolated as described in the manufacturers instructions (Epicenter Technologies, 
Inc.). HEK 293T cells were grown in Dulbecco's Modified Eagle's Medium (Life Technologies) 
supplemented with 10% v/v fetal bovine serum (Hyclone, Inc., Logan, UT). All cells were kept 
in a humidified incubator at 37 °C and 5% CO2. 

10.1.3 REVERSE TRANSCRff TION-POLYMERASE 
CHAIN REACTION (TR-PCR) 

[00172] RT-PCR was performed using an EZ-RT-PCR kit (Perkin-Elmer, Foster, CA). 

Each reaction contained 0.03 to 1.0 jug of total RNA and 80 ng of a 5* and 3' specific primer in a 
40 /A reaction volume. RT-PCR products were electrophoresed on 2% Seaken agarose gels. 
The PTM- and target-specific oligonucleotides used to generate £ra«s-spliced products are 
5'-CGCTGGAAAAACGAGCTTGTTG-3 f (primer CF93) ( SEQ ID NO:74) and 
5 f -ACTCAGTGTGATTCCACCTTCTC-3' (primer CF111) ( SEQ ID NO:75) , respectively. 
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The PTM- and target-specific oligonucleotides used to generate c/s-spliced products were CF1 
and CF93. The sequence of oligonucleotide CF1 is 

5'-GACCTCTGCAGACTTCACTTCTAATGATGATTATGG-3' ( SEQ ID NO:76). 
[00173] The repair model in Fig. 33 shows a portion of a target CFTR pre-mRNA 

consisting of exons 1-9, mini-intron 9, exon 10 containing the delta 508 mutation, mini-intron 10 
and exons 11-24 (Fig. 33). The PTM shown in the figure consists of exon 1-10 coding 
sequences (containing codon 508) and a fraws-splicing domain with its own splicing elements 
(donor site, branchpoint and pyrimidine tract) and a binding domain. Several PTMs have been 
constructed with different binding domains. Three examples are shown in Figure 34. In Fig. 
34A the binding domain is complementary to the splice site of intron 9 and part of exon 10 (3* 
end; CF-PTM 11). In Fig. 34B the PTM has an extended binding domain which also covers the 
5' end of exon 10 and the 3' splice site of intron 9 (CF-PTM 20). In the last example (Fig. 34C) 
the binding domain is the same as that shown in panel B except the binding domain extends the 
full-length of exon 10 (CF-PTM 30). In the latter case the PTM exon 10 has modified codon 
usage to reduce antisense effects with it's own binding domain (Fig. 34). Further examples of 
binding domains are shown in Figure 35. 

[00174] Figure 36 shows the sequence of cis- and frans-spliced products. The top panel of 

Fig. 36A shows target exon 10 with it's three missing nucleotides (CTT) (SEQ ID NO:93), whilst 
the lower panel shows exon 10 and 11 of the target correctly spliced together (SEQ ID NO:94). 
Figure 36B is a partial sequence of a single PCR product showing the modified codons in exon 
10 of the PTM (upper panel) (SEQ ID NO:95), codon 508 in exon 10 of the PTM (middle panel) 
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(SEQ ID NO:96), and PTM exon 10 correctly spliced to target exon 11 (lower panel) (SEQ ID 
NO:97), indicating that trans-splicing is accurate. The sequence of the repaired target was 
generated by RT-PCR followed by PCR. 

1 1 . EXAMPLE: PTMs WITH A LONG BINDING DOMAIN, 
WHICH MAY BE DISCONTINUOUS, HAVE 
INCREASED m^/VS-SPLICING EFFICIENCY 
AND SPECIFICITY 

11.1. MATERIALS AND METHODS 

11.1.1. CELL CULTURE 
[00175] Human embryonic kidney cells (293 or 293T) were from the University of North 

Carolina tissue culture facility at Chapel Hill (Chapel Hill, NC). Cells were maintained at 37°C 
in a humidified incubator with 5% CO2 in Dulbecco's modified Eagle's medium (Life 
Technologies, Bethesda, MD) supplemented with 10% v/v fetal bovine serum (Hyclone, Logan, 
UT). Cells were passaged every 2-3 days using 0.5% trypsin and re-plated at the desired density. 
Stable cells, expressing an endogenous mutant lacZ pre-mRNA (lacZCF9) were maintained in 
the presence of 0.5 mg/ml G418 (Calbiochem, San Diego, CA). 

11.1.2. RECOMBINANT PLASMIDS 
[00176] Targets: pc3.11acZCF9, pc3.11acZCF9m, and pc3.11acZHCGlm. pc3.11acZCF9 

encodes for a normal lacZ pre-mRNA was constructed using lacZ coding sequences nucleotides 
1-1788 as 5' exon, CFTR mini-intron 9 followed by lacZ coding sequences nucleotides 1789- 
3174 as 3' exon. This is similar to pc3.11acZ-T2 construct but without stop codons in the lacZ 3* 
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exon and has CFTR mini-intron 9 instead of (5HCG6 intron 1 (Fig. 37A). CFTR mini-intron 9 
was PCR amplified using plasmid T5 as template and primers CFIN-9F (5- 
CTA GGATCC CGTTCTTTTGTTCTTCACT ATTAA) ( SEQ ID NO:77) and CFIN-9R (5 1 - 
CTAGGGTTACCGAAGTAAAACCATACTTATTAG, restriction sites underlined) ( SEQ ID 
NO:78), digested with BamH I and BstE II and cloned in place of BHCG6 intron 1 of pc3.11acZ- 
T2 plasmid. pc3.11acZCF9m expresses a defective lacZ pre-mRNA and is identical to 
pc3.11acZCF9 but contains two in-frame non-sense codons in the 3' exon (Fig. 37A). 
pc3.11acZHCGlm is a chimeric target, which includes the lacZ 5 1 exon followed by intron 1 and 
exon 2 of PHCG6. This is similar to pc3.11acZCF9m except that it contains exon 2 of pHCG6 in 
place of mutant lacZ 3' exon. pHCG6 exon 2 was PCR amplified using pHCG6 plasmid 
(accession # X00266) as template DNA and primers HCGEx-2F (5'- 
GCAT GGTTACC CTGCAGGGGCTGCTGCTGTTGCTG) ( SEQ ID NO:79) and HCGEx-2R 
(S'-CTG AAAGCTT GTTAACCAGCTCACCATGGTGGGGCAG, restriction sites underlined) 
(SEQ ID NO:80) digested with BstE II and Hind III and cloned in place of the lacZ 3' exon of 
pc3.11acZCF9m. Plasmid pcDNA3.1/HisB//acZ (Invitrogen, Carlsbad, CA) was used as DNA 
template to produce 5' and 3 ! lacZ exons. The lacZ 5' exon is 1788 bp long, has an ATG 
initiation codon, lacZ 3 ? exon (without stop codons) is 1385 bp long and has a transcription 
termination signal at the end of the 3' exon. CFTR mini-intron 9 and pHCG6 intron 1 are 548 bp 
and 352 bp in size, respectively, and both have 5' and 3 1 splice signals. Exon 2 of pHCG6 is 162 
bp long and has a transcription termination signal at the end of the exon. 
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[00177] Pre-trans-splicing Molecules (PTMs): PTM-CF14 is an identical version of 

pcPTMl with minor modifications in the trans-splicing domain (Fig. 37B). PTM-CF14 is a 
linear version and contains a 23 bp antisense binding domain (BD) 
(5 ? -ACCCATCATTATTAGGTCATTAT) ( SEQ ID NO:81) complementary to CFTR mini- 
intron 9, 18 bp spacer, a canonical branch point sequence (UACUAAC; BP) and an extended 
polypyrimidine tract (PPT) followed by normal lacZ 3* exon. PTM-CF22, PTM-CF24, PTM- 
CF26 and PTM-CF27 are identical to PTM-CF14 except they differ in length of the BD 
(Fig. 37B). sPTM-CF18 has a 32 bp BD, sPTM-CF22 and sPTM-CF24 contain the same BD as 
PTM-CF22 and PTM-CF24, respectively. In these PTMs, the binding domains were modified to 
create intra-molecular stem-loop structure ("safety") to mask the 3' splice-site of the PTM. 
Different binding domains were produced by PCR amplification using specific primers (with 
unique Nhe I and Sac II sites) and a plasmid containing CFTR mini-intron 9 as template. PCR 
products were digested with Nhe I and Sac II and cloned into a PTM plasmid consisting of 
spacer sequences, 3 f splice elements (BP, PPT and acceptor AG dinucleotide) followed by a 
normal lacZ 3 1 exon. 

11.1.3. TRANSFECTION OF PLASMID DNAs INTO 293T CELLS 
[00178] The day before transfection, 1 x 10 6 293T cells were plated on 60 mm plates 

coated with Poly-D-lysine (Sigma, St. Louis, MO) to enhance the adherence of cells and grown 
for 24 hr at 37 °C. Cells were transfected with expression plasmids using LipofectaminePlus 
reagent according to standard protocols (Life Technologies, Bethesda, MD). In a typical co- 
transfection, 2 y.% of pc3.11acZCF9m target and 1.5 /^g of PTM expression plasmids were 
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transfected into cells and for controls (target and PTM alone transfections) total DNA 
concentration was normalized to 3.5 /zg with pcDNA3.1 vector. 

[00179] Forty-eight hours after transfection the plates were rinsed with PBS, cells 

harvested and total RNA or DNA was isolated using MasterPure RNA/DNA purification kit 
(Epicenter Technologies, Madison, WI). Contaminating DNA in the RNA preparation was 
removed by treating with DNasel, while, contaminating RNA in the DNA preparation was 
removed by digesting with RNase A at 37°C for 30-45 min. 

11.1.4. REVERSE TRANSCRIPTION-POL YMERASE 

CHAIN REACTION (RT-PCR) 

[00180] RT-PCR was performed as suggested by manufacturer using an EZ rTth RNA 

PCR kit (Perkins-Elmer, Foster City, CA). A typical reaction (50 iA) contained 25-500 ng of 
total RNA, 100 ng of 5' target specific primer (common to cis- and /raws-spliced products) (Lac- 
9F, 5 '-GATC AAATCTGTCG ATCCTTCC) ( SEQ ID NO:82) and 100 ng of 3' primer (Lac-3R, 
5'-CTGATCCACCCAGTCCCATTA, target specific primer for cw-splicing ( SEQ ID NO:83), 
and Lac-5R, 5'-GACTGATCCACCCAGTCCCAGA ( SEQ ID NO:84), PTM specific primer for 
frans-splicing), IX reverse transcription buffer (100 mM Tris-HCl, pH 8.3, 900 mM KCL with 
1 mM MnCl 2 ), 200 a<M dNTPs and 10 units of rTth DNA polymerase. RT reactions were 
performed at 60°C for 45 min. followed by 30 sec pre-heating at 94°C and 25-35 cycles of PCR 
amplification at 94°C for 18 sec, annealing and extension at 60°C for 1 min followed by a final 
extension at 70 °C for 7 min. The reaction products were analyzed by agarose gel 
electrophoresis. 
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11.1.5. PROTEIN PREPARATION AND B-GAL ASSAY 
[00181] Total cellular protein from cells transfected with expression plasmids was isolated 

by freeze thaw method and assayed for p-galactosidase activity using a P-gal assay kit 
(Invitrogen, Carlsbad, CA). Protein concentration was measured by the dye-binding assay using 
Bio-Rad protein assay reagents (BIO-RAD, Hercules, CA). 

11.1.6. WESTERN BLOT 

[00182] About 5-25 jug of total protein was electrophoresed on a 7.5% SDS-PAGE gel and 

electroblotted onto PVDF-P membrane (Millipore). After blocking for 1 hr at room temperature 
(blocking buffer: 5% dry milk and 0.1% Tween-20 in IX PBS), the blot was incubated with a 
1:2500 dilution of polyclonal rabbit anti-p-galactosidase antibody for 1 hr at room temperature 
(Research Diagnostics Inc. NJ), washed 3x with blocking buffer and then incubated with a 
1:5000 diluted anti-rabbit HRP conjugated secondary antibody. After incubating at room 
temperature for 1 hr, it was washed 3x in blocking buffer and developed using ECLPlus Western 
blotting reagents (Amersham Pharmacia Biotech, Piscataway, NJ). 

11.1.7. IN SITU B-GAL STAINING 

[00183] Cells were monitored for the expression of functional p-galactosidase using a (3- 

gal staining kit (Invitrogen, Carlsbad, CA). The percentage of P-gal positive cells were 
determined by counting stained vs. unstained cells in 5-10 randomly selected fields. 
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1 1.1.8. SELECTION OF NEOMYCIN RESISTANT CLONES 
EXPRESSING AN ENDOGENOUS DEFECTIVE lacZ 
PRE-mRNA TARGET 

[00184] On day 1, 1 x 10 6 293 cells were plated on 60 mm plates and grown for 24 hr at 

37°C On day 2, the cells were transfected with 2 jug of pc3.11acZCF9m using 
LipofectaminePlus transfection reagent as described above. 48 hr post-transfection, cells were 
split (1:20 ratio) and grown in media containing 0.5 mg/ml G418. At the end of 2 weeks, 
neomycin resistant colonies were selected, pooled, expanded and maintained constantly in the 
presence of G418. 

11.2. RESULTS 

[00185] A model system was developed that permits facile and versatile analysis of 

spliceosome mediated RNA /raws-splicing in cells. The bacterial lacZ gene was split with a 
truncated intron 9 from the Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) 
gene (Figure 37A). This split lacZ gene, when introduced into human 293T cells, directed the 
synthesis of a lacZ pre-mRNA that could splice properly. The open reading frame of the lacZ 
gene was mutated by insertion of two in-frame nonsense codons near the 5 1 end of the second 
exon (Figure 37A). This lacZ gene is referred to as lacZCF9m. In 293T cells, lacZCF9m directs 
the synthesis of lacZCF9m pre-mRNA, which encodes a truncated P-galactosidase (P-gal) 
protein that does not have enzymatic activity. Cells bearing the lacZCF9m gene are a model 
system for genetic disorders caused by loss of function mutations. 

[00186] Pre-rra«5-splicing molecules (PTMs) were designed to fr^ms-splice with 

lacZCF9m pre-mRNA and repair the mutation caused by the two nonsense codons. PTMs were 
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constructed with binding domains spanning 23, 91 and 153 nucleotides (nt), which we named 
PTM-CF14, PTM-CF22 and PTM-CF24 (Figure 37B). The PTM-CF24 binding domain does 
not bind 153 contiguous nt in the targeted CFTR gene intron 9, but rather creates a loop of 47 nt 
in the target in between two regions of complementary of 27 and 126 nt (Figure 37B). These 
PTMs were predicted to repair the deficiency created by lacZCF9m (Figure 37C). 
[00187] Semi-quantitative RT-PCR analysis was used to tests the efficiency of trans- 

splicing mediated by PTMs with long target binding domains. Repair of lacZCF9m transcripts 
by /raws-splicing was tested in two different ways: co-transfection of PTM and target 
(lacZCF9m) plasmids or transfection of cells that had been modified to express the target as an 
endogenous pre-mRNA. Co-transfecting plasmids encoding PTMs with the lacZCF9m plasmid 
provided a facile method for screening the former for efficiency. PTM-CF22 and PTM-CF24 
were approximately 3 -fold and 10-fold more efficient than PTM-CF14 in a semi-quantitative 
RT-PCR assay suggesting a significant improvement in mRNA repair (Figure 38). Sequencing 
of the RT-PCR products showed that /ra^s-splicing was accurate, resulting in proper ligation of 
the exons from the target and the PTM. Moreover, mutation of key as-acting elements in the 3' 
splice site of the PTMs resulted in an abrogation of /ra«s-splicing. In these and all other assays 
described herein controls were carried out to rule out recombination at the DNA level. Thus, 
repair of the lacZCF9m transcripts was a result of targeted RNA /raws-splicing. 
[00188] Transfection of PTM-CF14, -CF22 or -CF24 into 293 cells bearing an 

endogenous lacZCF9m gene confirmed that the longer target binding domains provided the 
PTMs with higher efficiency (Figure 38B). It should be noted that similar levels of RT-PCR 
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taws-splicing specific product were obtained after 30 PCR cycles and 35 cycles for PTM-CF24 
and PTM-CF14, respectively. The data therefore suggests that PTMs with long binding domains 
repaired lacZCF9m transcripts at least an order of magnitude better than previously described 
PTMs. 

[00189] More than one in ten transcripts of lacZCF9m can be repaired by /r<ms-splicing. 

Quantitative, real-time PCR was used to measure the fraction of lacZCF9m transcripts repaired 
by PTMs with long binding domains. The co-transfection assay described above was used in 
these experiments. PTM-CF14, which contains a binding domain of 23 nt, was shown to repair 
between 1.2 and 1.6% of lacZCF9m RNAs in 293T cells and 2.1% of lacZCF9m RNAs in the 
HI 299 human lung cancer cells. PTM-CF24, which has a 153 not long binding domain, was 
significantly more efficient, correcting between 12.1 and 15.2% of lacZCF9m RNAs in 293T 
cells and 19.7% in HI 299 cells. This in effect resulted in a measurable reduction in the levels of 
lacZCF9m mRNA. These data also confirmed the remarkable capability of this RT-PCR assay 
to distinguish between the products of c/s-splicing, the lacZCF9m and mRNA, and the products 
of /raws-splicing, repaired lacZCF9m mRNA. This is the first true quantification of the efficacy 
of trans-splicing mediated mRNA repair at the RNA level. These data confirm the suggestions 
of the semi-quantitative RT-PCR analysis shown above. Similar experiments were carried out 
using 293 cells that express an endogenous lacZCF9m pre-mRNA target. Consistent with the 
data shown above, PTM-CF24 was ten times more efficient than PTM-CF14, with the former 
correcting between 1.3 and 4.1% of endogenous lacZCF9m transcripts. These data confirmed 
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that increasing the length of the PTMs provided a remarkable enhancement in trans-splicing 
efficiency. 

[00190] rra/w-splicing mediated mRNA repair results in the synthesis of active 

P-galactosidase. At the cellular level, the ultimate criterion for the success of mRNA repair is 
the production of an active protein. Using a western assay it was determined that full-length P- 
gal was produced as a result of trans-splicing . Full-length P-gal was not observed following 
transfection of 293T cells with plasmids encoding lacZCF9m or PTM-CF24. Co-transfection of 
both plasmids, however, resulted in robust production of full-length P-gal protein, which was 
readily detectable using anti-P-gal antiserum (Figure 39). This result complements enzymatic 
activity data suggests that the latter was not due to a complementation by truncated p-gal 
proteins. The Western blot analysis revealed that full-length p-gal protein was made in 293T 
cells by /raws-splicing and furthermore confirmed that the PTMs with long binding domains 
were efficiently spliced. 

[00191] Appropriate repair of P-gal mRNA and synthesis of full-length p-gal protein 

should lead to the production of active enzyme. Indeed, 293T cells co-transfected with 
lacZCF9m and PTM-CF24 were shown to have p-gal activity measured either in situ (Figure 
40A) or in extracts (Figure 40B). This activity was shown to depend on the ^raws-splicing 
between the target pre-mRNA and the PTM. The quantitative in solution assay further 
confirmed the data presented above: PTM-CF22 and PTM-CF24 were 2.9 and 9.3 fold more 
efficient respectively than PTM-CF14. Most impressive, however, were results using 293 cells 
that harbor lacZCF9m as a stable endogenous gene. When these cells were transfected with 
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PTM-CF14 the levels of p-gal activity obtained were barely above background. Transfection 
with PTM-CF24, however, resulted in a considerable level of p-gal activity (Figure 40C). This 
was paralleled by the appearance of full-length p-gal protein. These data demonstrate a sizeable 
increase in the efficiency of /raws-splicing to repair a mutated pre-mRNA. In fact all prior 
reports of repair of endogenous RNA in mammalian cells by either group I ribozymes or trans- 
splicing have been only documented using RT-PCR, an indication of the low level of repair. 
[00192] PTMs with very long binding domains are highly specific. It was shown that a 

secondary structure within the binding domain could enhance specificity of PTMs in HeLa 
nuclear extracts. In order to ascertain the specificity of the fraws-splicing reactions in vivo a 
second target gene was prepared, which could serve as reporter of non-specific reactions. This 
gene, which is referred to as lacZHCGlm, shares the first exon with lacZCF9m. The intron in 
lacZHCGlm is intron 1 of the p-subunit of the human chorionic gonadotropin gene 6 (phCG6) 
and the second exon is exon 2 of the same gene. lacZHCGlm drives the synthesis of a pre- 
mRNA that is spliced correctly to yield a chimeric mRNA that does not encode a full-length p- 
gal (see below). PTM-CF14, -CF22 and -CF24 are not targeted to lacZHCGlm pre-mRNA since 
there is no complementarity between the binding domains in these PTMs and the target gene. 
Any /raws-splicing between these PTMs and lacZHCGlm pre-mRNA is therefore non-specific 
(Figure 41 A). 

[00193] 293T cells were transfected with PTM-CF14, -CF22 or -CF24 and the level of 

non-specific ^raws-splicing was determined by RT-PCR and by in solution P-gal assays. Semi- 
quantitative RT-PCR suggested that PTM-CF24 was significantly less likely than PTM-CF14 to 
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trans-splice with lacZHCGlm pre-mRNA. Measurement of P-gal activity confirmed this; cells 
co-transfected with lacZHCGlm and PTM-CF24 produced 3.7 fold less p-gal than those co- 
transfected with lacZHCGlm and PTM-CF14 (Figure 41C). Based on these data it was 
estimated that PTM-CF24 is 50 times more likely to trans-splice to its target than to a non- 
specific target. A "safety" version of PTM-CF24, sPTM-CF24, did not confer further specificity 
(Figure 41 C). Nonetheless, for PTMs with shorter binding domains a "safety" stem involving 
the binding domain was seen to improve specificity in vivo (Figure 41C). It was concluded from 
these data that the longer binding domains resulted in PTMs that were not only more efficient but 
also more specific. 

[00194] The observation that long binding domains increased the specificity of PTMs 

suggested that very long binding domains (>200nt) could further enhance discrimination. 
Plasmids encoding PTM-CF26 and -CF27, which have binding domains that span 200 nt and 
411 nt respectively, were constructed and co-transfected with lacZHCGlm plasmid. Non- 
specific /ra/w-splicing of these two PTMs was barely detectable with RT-PCR (Figure 4 IB). As 
measured by the P-gal assay PTM-CF26 and -CF27 had minimal non-specific trans-splicing 
activity (Figure 41C). In a specific /raws-splicing reaction with lacZCF9m as measured by the 
solution p-gal assay PTM-CF26 was as active as PTM-CF14 (Figure 4 IB). It was estimated that 
PTM-CF26 is 80 times more likely to trans-splice to the specific target (lacZCF9m) than to a 
non-specific target (lacZHCGlm). Therefore, inclusion of very long binding domains confers to 
these PTMs very high specificity. 
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[00195] The present invention is not to be limited in scope by the specific embodiments 

described herein. Indeed, various modifications of the invention in addition to those described 
herein will become apparent to those skilled in the art from the foregoing description and 
accompanying Figures. Such modifications are intended to fall within the scope of the appended 
claims. Various references are cited herein, the disclosure of which are incorporated by 
reference in their entireties. 
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